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Novel Compounds 



The present invention relates to recombinant he tero chimeric paramyxoviridae 
glycoproteins and their expression in eukaryotic cells, particularly in Chinese 
5 Hamster Ovary (CHO) cells. The invention further relates to methods for 

constructing and expressing such heterochimeric proteins, intermediates for use 
therein, methods to optimize the codon usage of the nucleic acid sequences which 
encode such heterochimeric proteins and the use of the recombinant proteins as 
vaccines for the prevention of diseases caused by paramyxoviridae pathogens. 

10 

The mumps (MuV), Measles (MV), the parainfluenza type I (PIV1), type II (PIV2) 
and type III (PIV3) and the respiratory syncytial (RSV) virus belong to the 
paramyxoviridae family. The MuV is classified in the nibulavirus subclass, the MV 
is classified in the Morbillivirus subclass, the parainfluenza viruses (PrVl, PIV2 
15 and PIV3) are classified in the paramyxovirus subclass while the RSV is attached to 
the pneumo virus subclass. 

RSV is the most important cause of viral lower respiratory tract disease in infants 
and children. The fusion (F) and the attachment (G) protein which are both viral 
20 surface glycoproteins appear to be of potential value for the development of a 
vaccine against RSV. 

The fusion protein F of RSV contains 574 amino acid residues; amino acids 1 to 21 
correspond to the signal peptide and residues 525 to 549 to the membrane anchor 
25 domain. The molecule presents five potential sites for glycosylation. The F protein 
is synthesized as a 70 kDa precursor (F 0 ) which undergoes proteolytic maturation to 
yield the F, subunit (48 kDa) and F 2 (23 kDa) linked via disulfide bridges. The 
protein F, when injected into animals, leads to the production of neutralizing 
antibodies and may induce cytotoxic lymphocytes (CTLs). 

30 

The attachment or G protein of RSV contains 298 amino acid residues and is 
heavily glycosylated since half of its molecular mass (90 kDa) is contributed by 
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oligosaccharide side chains, chiefly in the form of O-linked sugars. It has been 
shown that the G protein, when injected into animals » provides protection against 
homologous but not heterologous subgroup virus challenge. This protein is 
extremely variable and there is only a stretch of 13 amino acid residues which is 
5 conserved in all RSV. 



The PIV3 is second to RSV as a major agent of severe viral respiratory tract 
infections in infants. The fusion protein F of PIV3 contains 539 amino acid 
residues; amino acids 1 to 18 correspond to the signal peptide and residues 494 to 

10 516 to the membrane anchor domain. The molecule presents 4 potential sites for 
glycosylation. The F protein is synthesized as a 70 kDa precursor (F 0 ) which 
undergoes proteolytic maturation to yield the F x (56 kDa) and F 2 (14 kDa) subunits 
linked via disulfide bridges. The protein F, when injected into animals, leads to the 
production of neutralizing antibodies. The F protein is involved in cell fusion during 

15 viral infection and carries an hemolysin activity. Used alone for immunization, the 
F protein generates an immune response which is insufficient to confer protection 
against a challenge with the virus. Complete protection is only acquired by 
concomitant immunization with the attachment protein HN, another glycoprotein of 
PIV3. 

20 

The protein HN carries hemagglutinin and neuraminidase activities. It is composed 
of 572 amino acids; its membrane anchor domain occurs in the N-terminal end of 
the molecule, between amino acid residues 32 and 53. Four potential sites for 
-gi^oryhation'fraw 

25 an immune response and neutralizing antibodies. These antibodies however do not 
protect completely against a challenge with the virus. Full protection is obtained 
only by concomitant immunization with the F protein of PIV3 . 

The PIV1 virus was initially isolated from young children suffering from disorders 
30 of the lower respiratory tract. Infection with PIV1 causes the majority of cases of 
croup found for all infections caused by paramyxoviruses. Viral transmission of 
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PIV1 is by person to person contact or by aerosol, although the virus does not 
persist in the environment for long. 

Like PIV2 and PIV3, the PIV1 virus has two surface glycoproteins, the fusion 
5 protein (F) and the attachment protein (HN). These two proteins are the priority 
targets for the development of a subunit vaccine, the properties of which would be 
to ensure protection of children from the very first months of life and to prevent 
reinfection, or at least to prevent the serious complications by restricting viral 
development to the upper respiratory tract where the consequences would be benign 
10 (common cold) . 

PIV2 also affects very young children and causes the same type of respiratory 
disorders, essentially croup, but of less severity. The PIV2 virus has two surface 
glycoproteins (F and HN), which are potential targets for the development of a 
15 subunit vaccine. 

The measles virus is an extremely contagious agent which establishes itself in the 
epithelial cells of the respiratory tract, the oropharynx or the conjunctiva. The 
infection causes fever, cough, head-cold, conjunctivitis and a characteristic 
20 generalised rash. 

There is no appropriate inactivated vaccine against measles but an effective 
attenuated live vaccine is available and is generally used in combination with the 
attenuated live vaccines against rubella and mumps. ThMve -vaccine protects 

25 against the disease for at least 20 years. The measles virus has two surface 

glycoproteins, which are potential targets for the development of a subunit vaccine. 
The fusion protein (F) is a 550 amino acid. long glycosylated molecule and, as for 
the other paramyxovirus, has to undergo proteolitic cleavage to yield F, and F 2 
subunits that are linked via disulfide bridges. This molecule, which carries a 

30 haemolysin activity, generates an.immune protective response when injected into 

animals. The attachment protein (H), is a 617 amino acid long glycosylated protein, 
which carries a hemagglutinin activity. This protein leads, when injected into . 

-3- 
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animals, to the production of neutralizing antibodies that are able to inhibit 
hemagglutination. This immune response protects the animal against a viral 
challenge. 

5 The mumps virus is a pathogen causing the contagious infantile illness which 
consists of the inflammation of parotid glands. During the incubation period 
following infection, the virus replicates in the respiratory epithelium then 
disseminates into secretary ducts of the parotid glands. Other glands may become 
infected thereafter and numerous cases of meningitis have been reported. Among 
10 complications related to the infection, encephalitis is a serious one, with a mortality 
rate of about 1%; deafness cases have also been reported. 



A vaccine against mumps is available: it is made of an attenuated live virus, 
produced by culturing infected embryonic chicken cells. The vaccine leads to the 
15 seroconversion in vaccinated individuals and protects against infection in more than 
95% of seronegative persons. The vaccine thus reduces significantly the frequencies 
of complications. 

In a number of cases, however, viral infection is not detected because the effects 
20 remain subclinical. Young children and aged people are most likely to develop 

complications from mumps infection. In view of the inherent risks related to the use 
of attenuated live vaccines, such as the potentiation of the illness upon natural 
surinfection in vaccinated individuals, it is desirable to improve the safety of the 
vaccine ,-partrcalarly f or r the 'groups at risfc . 

25 

The fusion protein F of mumps virus contains 538 amino acid residues; amino acids 
1 to 26 correspond to the signal peptide and residues 483 to 512 to the membrane 
anchor domain. The molecule presents 7 potential sites for glycosylation. The F 
protein is synthesized as a 65-74 kpa precursor (F 0 ) which undergoes proteolytic 
30 maturation to yield the F f (58-61 kDa) and F 2 (10-16 kDa) subunits linked via 
disulfide bridges. The protein F is involved in cell fusion during viral infection, 
carries an haetnolysin activity and plays a role for viral penetration into cells. It 
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does not however carry the antibody dependent cellular cytotoxicity (ADCC) as 
observed for another mumps virus glycoprotein, HN. 

The protein HN (molecular weight 74-80 kDa) carries hemagglutinin and 
5 neuraminidase activities which are involved in virus attachment to cells and in the 
disruption of the host cell membranes. Protein HN (attachment protein or 
hemagglutinin-neuraminidase) generates neutralizing antibodies and appears 
important for the development of ADCC. Protein HN is composed of 582 amino 
acids; it carries a N-terminal anchor domain (residues 33 to 52) and 9 potential sues 
10 for glycosylation. 

For the viruses considered above, it appears that concomitant immunization with 
both membrane glycoproteins F and HN, or G in the case of RSV. are required to 
achieve full protection in the animal model. Chimeric proteins containing both the F 
15 and G proteins of RSV. or the F and HN proteins of PIV3 have shown complete 
protection against RSV or PIV3 challenge in cotton rats (Brideau et al, J Gen Virol. 
1989, 70 2637-2644 and Brideau et al. J Gen Virol, 1993. 74. 471-477). 

W093 14207 (Connaught) describes heterochimeric proteins comprising RSV and 
PIV3 proteins including F(RSV)xHN(PIV3) and F(PIV3)xG(RSV) hybrids, and 
suggests that such proteins can be expressed from a variety of host cells including 
bacterial, mammalian, insect, yeast and fungal cells. The specific examples 
describe expression in insect Sf9 and High 5 cells and mammalian Vero cells. 
There is no specific disclosure of the use dFCHO cells: Tne use -of 
cells is also described by Du et al. BIO/TECHNOLOGY 12.1994. 813-818. 



20 



25 



30 



Homa et al (Upjohn), J Gen Virol, 1993. 74. 1995-1999 describes another 
heterochimeric protein. F(RSV)xHN(PIV3) expressed in insect cells using a 

recombinant baculovims. 

Homochimeric paramyxoviridae glycoproteins have also been described by several 
workers :- 
-5- 
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WO8905823 (Upjohn) describes RSV FxG and GxF hybrids which can be expressed 
from bacterial, yeast, mammalian and insect cells. Example 7 describes the 
expression of an RSV FxG protein from CHO cells although there are no details of 
5 how successful such expression is. 

WO8910405 (Upjohn) describes PIV3 FxHN and HNxF hybrids which can be 
expressed from bacterial, yeast, mammalian and insect cells. Example 6 describes 
the expression of a PIV3 FxHN protein from CHO cells, however no details are 
10 given quantifying the extent of expression and secretion. 

Lehman et al (Upjohn), J Gen Virol, 1993, 74, 459^69 describes the expression of 
PIV3 FxHN in insect cells using recombinant baculovirus vectors as well as in CHO 



15 



cells. 



WO9306218 (SmithKline Beecham Biologicals) describes PIV3 FxHN hybrids 
which can be expressed in eukaryotic cells including vaccinia, CHO or Vero cells. 
Example B)2 describes the expression of a FsVxHNa hybrid in CHO cells and 
indicates that the product was almost evenly distributed between cells and medium. 
20 No details are however given quantifying the extent of expression and secretion. 

WO9425600 (SmithKline Beecham Biologicals) describes MuV FxHN and HNxF 
hybrids which can be expressed in vaccinia, a mammalian cell (such as CHO) or a 
. bacterial cell. Examples B) 3 and 4 describe the expression of s*FHNa"xFa anii 
25 Fs + a"xHNa" in CHO cells however no details are given describing the extent of 
expression and secretion. 

Although this cited art may suggest that homochimeric paramyxoviridae 
glycoproteins can be expressed in a variety of cell lines including CHO cells it has 
30 now been discovered that in fact expression and secretion from CHO cells is not 
always successful and success cannot be predicted. Thus it has now been 
demonstrated that although a RSV FxG hybrid could be successfully expressed and 

-6- 
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secreted in CHO cells, analogous homochimeric hybrids from PIV3 and MuV could 
not in fact be expressed in CHO cells in such manner that they could be purified 
from the supernatant in significant quantities. 

5 Surprisingly, it has now been discovered that heterochimeric hybrids can be 
successfully expressed and secreted in both CHO and insect cells. 

Accordingly in a first aspect the present invention provides a process for preparing a 
heterochimeric protein or an immunogenic derivative thereof comprising an 
immunogenic fragment of the fusion (F) protein of RSV, PIV1, PIV2, PIV3, MV 
or MuV and an immunogenic fragment of the attachment (G, HN or H) protein of 
RSV, PIV1, PIV2, PIV3, MuV or MV which process comprises expressing 
recombinant DNA encoding the heterochimeric protein or immunogenic derivative 
thereof in CHO cells and recovering the protein. 



10 



15 



By heterochimeric protein is meant one that does not contain a fusion or attachment 
protein from the same pathogen. 

This invention also provides novel heterochimeric proteins not previously described 
20 in WO 93 14207 which can be prepared using the process of the present invention. 

Thus, in a second aspect the present invention provides a heterochimeric protein or 
. immuno^nic derivative thereof comprising an immunogenic fragment of the 
fusion (F) protein of RSV, PIV1, PIV2, PIV3, MV or MuV and an immunogenic 
25 fragment of the attachment (G, HN or H) protein of RSV. PIV1, PIV2, PIV3, MuV 
or MV, with the proviso that where one of me immunogenic fragments is derived 
from RSV F, RSV HN or PIV3 F, PIV3 HN, the other of the immunogenic 
fragments is derived from MuV F, MuV HN, MV F. MV H. PIVl F.PIV1 HN, 
PIV2 F or PFV2 HN. 



30 



By an immunogenic fragment of the fusion (F) protein of RSV, PIV1, PIV2, PIV3, 
MV or MuV is meant a part of the protein which contains at least one antigenic 



- 7- 
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determinant capable of raising an immune response specific to the F protein of 
RSV, PW1. PIV2, PIV3, MV or MuV respectively. Included within this definition 
is the full length F protein, preferably however the immunogenic fragment is 
lacking the membrane anchor domain at its C-terminal end. 

5 

By an immunogenic fragment of the attachment protein (G, HN or H) of RSV. 
PIV1 PIV2, PIV3, MuV or MV is meant a part of the protein which contains at 
least one antigenic determinant capable of raising an immune response specific to 
the G protein of RSV, to the HN protein of PIV1 , PIV2, PIV3 , MuV or the H 
10 protein of MV respectively. Included within this definition is the full length G or 
HN protein, preferably however the immunogenic fragment is lacking the 
signal/anchor domain at its N-terminal end. 

Preferably the heterochimeric protein is linked via an amino acid in the C-terminal 
15 part of the immunogenic fragment of the F protein of RSV, PIV1, PIV2, PIV3, 

MV or MuV to an amino acid in the N-terminal part of the immunogenic fragment 
of the G proteinof RSV, the HN protein of PIV 1 , PIV2, PIV3. MuV or the H 

protein of MV. 

20 Suitably the heterochimeric protein commences at its N-terminal end with a signal 
sequence from the F protein of RSV, PW1. PIV2, PIV3, MV or MuV. 
Conveniently this will be part of the corresponding immunogenic fragment of the F 
protein ,of RS V , PIV1, PIV2, PTV3, MV or MuV when this fragment is linked v/ a 
its C-terminal end to the N-terminal end of the immunogenic fragment 5Tth.ru 

25 protein of RSV, the HN proteinof PIV1, PFV2. PIV3, MuV or the H protein of 
MV. 

. Alternative signal sequences may also be employed. For example, the 
heterochimeric protein suitably commences at its N-terminal end with a signal 
30 sequence of tissue plasminogen activator (TPA) . 



-8- 
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In order to enhance the level of expression the heterochimeric protein may further 
comprise a ubiquitin leader sequence which is suitably positioned after any signal 
sequence as hereinbefore described. Preferably the ubiquitin leader sequence is 
linked to the C-terminal end of the signal sequence of TPA. 

5 

Preferably the ubiquitin leader sequence is derived from yeast, for example as 
described in Ecker et al, J.Biological Chemistry, 1988. 264(13). 7715-7719. 



10 



Suitably a cleavage site is positioned between the C-terminal end of the ubiquitin 
sequence and the N-terminal end of the immunogenic fragment of the F protein of 
RSV. PIV1, PIV2, PIV3, MV or MuV. 



In order to facilitate chromatographic purification the heterochimeric protein 
suitably comprises a polyhistidine tail, for example as described in Hochuli et al, 

15 BIO/TECHNOLOGY, 1988, 1321-1325. The polyhistidine tail preferably 

comprises from 2 to 6 adjacent histidine residues which is suitably attached at the C- 
terminal end of the heterochimeric protein. Preferably a cleavage site is positioned 
between the polyhistidine tail and the C-terminal end of the immunogenic fragment 
of the G protein of RSV, the HN protein of PIV1, PIV2, PrV3, MuV or the H 

20 protein of MV. 

The cleavage site for the ubiquitin sequence and/or the polyhistidine tail may be 
chemical or enzymatic and preferably is an enterokinase cleavage site, for example 
as described in LaVallie et al. BIO^CHNOLOXJY , T993, W-TO. 

25 

Following expression and purification, treatment with an enterokinase will cleave 
off any ubiquitin and/or polyhistidine sequence releasing the desired heterochimeric 
protein. 

30. Particular heterochimeric proteins of this/invention include: 

the F protein of RSV lacking its membrane domain linked at its C-terminal end to 
the HN protein of MuV lacking its signal/anchor domain herein referred to as: 

- 9- 
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Fs + a RSVxHNs a MuV, as well as 
Fs + a PIV3 x HNs a- MuV; 
Fs + aMuV x GsaRSV; and 
FsV MuV x HNsaPIV3, and 
5 immunogenic derivatives thereof. 

The present invention also provides particular heterochimeric proteins which 

include: 

Fs + a'MuVxHsaMV; or 
10 Fs*aRSVxHNsaPIVl; or 

FsVRSVxHNs a PIV2, and 
imunogenic derivatives thereof. 

The present invention also provides heterochimeric proteins comprising RSV and 
15 PTV3 proteins not specifically disclosed in WO9314207, which advantageously can 
be expressed from CHO cells. 
These are: 

Fs + a (1-526) RSV x HNsa (70-572) PIV3; 
Fs + a (1-492) PIV3 x Gsa" (69-298) RSV; 
20 Fs + a (1-526) RSV x HNs a" (70-572) PIV3 bis; 

Fs + a (1-526) RSV x HNs a (70-572) PIV3 ent his, and 

sTPA (1-21) UB (1-74) ent Fs'a (24-526) x HN sa(70-572) PIV3, and 

immunogenic derivatives thereof. 



25 The heterochimeric proteins of the present invention are immunogenic. The term 
immunogenic derivative as used herein encompasses any molecule which is a 
heterochimeric polypeptide which is immunologically reactive with antibodies raised 
to the heterochimeric protein of the present invention or parts thereof or with 
antibodies recognising the F protein of RSV, PIV1, PIV2, PIV3, MV or MuV, the 

30 G protein of RSV, the HN protein of PIV1 , PIV2, PIV3 , MuV, the H protein of 
MV, the RSV virus, the PIV1 virus, the PI V2. virus, the PIV3 virus, the MV virus 
or the MuV virus, or which, when administered to a human, elicits antibodies 

- 10 - 
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recognising the F protein of RSV, PIV1, PW2. PIV3, MV or MuV. the G protein 
of RSV, the HN protein of PIV1 , PIV2, PIV3, MuV, the H protein of MV, the 
RSV vims, the PIV1 virus, the PIV2 virus, the PIV3 virus, the MV virus or the 
MuV virus. In particular immunogenic derivatives which are slightly longer or 

5 shorter than the heterochimeric proteins of the present invention may be used. Such 
derivatives may, for example, be prepared by substitution, addition, or 
rearrangement of amino acids or by chemical modifications thereof including the 
coupling or for enabling the coupling of the heterochimeric proteins to other carrier 
proteins such as tetanus toxoid or Hepatitis B surface antigen. All such substitutions 

10 and modifications are generally well known to those skilled in the art of peptide 
chemistry. 

Immunogenic fragments of the heterochimeric proteins which may be useful in the 
preparation of vaccines may be prepared by expression of the appropriate gene 
15 fragments or by peptide synthesis, for example using the Merrifield synthesis (The 
Peptides, Vol 2., Academic Press, New York, p3). 

In a further aspect of the invention there is provided recombinant DNA encoding' 
the heterochimeric protein of the invention. The recombinant DNA of the invention 
20 may form part of a vector, for example a plasmid, especially an expression plasmid 
from which the heterochimeric protein may be expressed. Such vectors also form 
part of the invention, as do host cells into which the vectors have been introduced. 

In order to construct the DNA encoding a heterochimeric protein according to the 
25 invention, cDNA containing the coding sequences of the RSV, PI VI, PIV2, PIV3, 
MV or MuV fusion and attachment proteins and: optionally of the ubiquitin, 
polyhistidine. and enterpkinase cleavage sites may be manipulated using standard 
techniques, [see for example Maniatis T. et al M olecular Cloning* Gold Spring 
Harbor Laboratory, Gold Spring Harbor N.Y, (1982)} as further described 
30 hereinbelow. 



11 
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In another aspect of the invention there is described a process of enhancing the 
protein expression in mammalian cells by optimization of the codon usage of the 
nucleic acids transfected therein. Optimization of the codon usage involves the 
replacement of at least one non-preferred or less preferred codon in a natural gene 
5 encoding a heterochimeric protein by a preferred codon encoding the same amino 
acid. Highly mammalian-expressed genes have C or G at their degenerative 
position (third base in the codon) whereas the RSV or PIV3-prevalent codons have 
A or T. At least one codon, and more prefereably all the codons of the RSV or 
PIV3 protein can be changed to fit at best the human usage, that is, the one (or 
10 ones) that is the most prevalent as shown below. 



Ala: GCC 


Cys: TGC 


His: CAC 


Met: ATG 


Thr: ACC 


Arg: CGC 
AGG 
CGG 


Gin: CAG 


He: ATC 


Phe: TTC 


Trp: TGG 


Asn: AAC 


Glu: GAG 


Leu: CTG 


Pro: CCC 


Tyr: TAC 


Asp: GAC 


Gly: GGC 


Lys: AAG 


Ser: AGC 
TCC 


Val: GTG 



15 Each amino acid encoded by one of these codons are then considered humanised. 
The ratio between the number of humanised codons versus the total number of 
amino acids gives a percentage of humanisation as shown below. 



1) 


F RSV'(r-S26)orijiii»l 




140/526 = 


27% 


2) 


F RSV (1^423)humanisod 


+ (424-526)original 


403/526 = 


77% 


3) 


' F . RSV ( 1 -326)huoiinuc<! 




489/526 = 


93% 


4) 


P. RSV (l-526)originil 


-+■ HN piv3 (70-372) original 


258/1029 


= 25% 


5) 


F RSV<l-526)hum»nbcd 


+ HN iuvs (7<M72> origin»l . 


528/1029 


= 51% 


6) 


F RSV (l : 526Uiura»nijed 


+ HN pI y 3 (70-372) humuiied 




96% 



25 
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The invention also provides DNA encoding a heterochimeric protein or 
immunogenic derivative thereof in which the codon usage of one or more nucleic 
acids has been substantially optimised and a process for expressing said DNA in a 
CHO or insect cell. 

5 

There have been a number of reports that have described a substantial amelioration 
of protein expression in mammalian cells after re-engineering the nucleic acid 
sequence of the heterologous protein to fit the codon usage found in highly 
expressed human genes (Haas J., Park E-C. and Seed B., Codon usage limitation in 

10 the expression of HiV-1 envelope glycoprotein, Current Biology, 1996, 6, n°3, 
315-325 ; Kim C. H., Oh Y. and Lee T.H., Codon optimization for high-level 
expression of human erythropoietin (EPO) in mammalian cells, Gene 199, 1997, 
293-301 ; Zolotukhin S., Potter M. Hauswirth W.W. Guy J. and Muzyczka N. A 
Humanized green fluorescent protein cDNA adapted for high level expression in 

15 mammalian cells. J. of Virology, July 1996, 70, n°7, 4646-4654). 

Vectors comprising such DNA, hosts transformed thereby and the truncated or 
hybrid proteins themselves, expressed as described hereinbelow all form part of the 
invention. 

20 

For expression of the proteins of the invention, plasmids may be constructed which 
are suitable either for transfer into vaccinia virus or transfection into CHO cells, 
insect cells or Vero cells. Suitable expression vectors are described hereinbelow. 
Preferably 4he -p?3C£in£ of the presenv-invcstion ^£*rapress^4n^GKG i3^4nseci 
25 cells. 

For expression in vaccinia a vaccinia transfer plasmid such as pULB 5213 which is 
a derivative of pSCll (Chakrabati et al. Molecular and Cellular Biology 5, 3403 - 
3409, 1985) may be used. In one aspect the protein may be expressed under the 
30 control of the vaccinia P7.5 promoter. 
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For expression in CHO-K1 cells a glutamine synthetase (GS) vector such as pEE14 
may suitably be used so that the protein is expressed under the control of the major 
immediate early promoter of human cytomegalovirus (hCMV-MIE). Alternatively 
a vector which allows the expression of the coding module as a polycistronic 
5 transcript with the neo selection gene may suitably be used. In one preferred aspect 
the coding module is under the control of the Rous Sarcoma Long Terminal Repeat 
(LTR) promoter. 

Preferably the plasmid for expression in CHO-K1 cells carries a GS expression 
10 cassette suitable for gene amplification using methionine sulphoximine (MSX). 
Alternatively the plasmid for expression in CHO-K1 cells carries a DHFR 
expression cassette suitable for gene amplification using methotrexate (MTX). 

Preferably expression of the heterochimeric protein of the present invention is 
15 carried out in the presence of sodium butyrate and/or dimethyl sulphoxide (DMSO) 
which may enhance gene expression. 

For expression in insect cells a shuttle vector such as pAcUWS 1 or pAcGP67 may 
be used. In one aspect the protein may be expressed under the control of the 
20 baculovirus plO promoter or the polyhedrin promoter. 

The expression system may also be a recombinant live microorganism, such as a virus 
or bacterium. The gene of interest can be inserted into the genome of a live 
-pscsmbinsnt *riras-or- bacterium ; inoculation ^and-m vivc infcrtiron *withniif s "hwvector 

25 will lead to in vivo expression of the antigen and induction of immune responses. 
Viruses and bacteria used for this purpose are for instance: poxviruses (e.g; vaccinia, 
fowlpox, canarypox), alphaviruses (Sindbis virus, Semliki Forest Virus, Venezuelian 
Equine Encephalitis Virus), adenoviruses, adeno-associated virus, picornaviruses 
(polio virus, rhinovinis), herpesviruses (varicella zoster virus, etc), Listeria, Salmonella, 

30 Shigella, BCG. These viruses and bacteria can be virulent, or attenuated in various 
ways in order to obtain live vaccines. Such live vaccines also form part of the 
invention. 
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In yet another aspect of the invention there is provided a vaccine composition 
comprising a heterochimeric protein or immunogenic derivative thereof according to 
the invention in combination with a pharmaceutical^ acceptable carrier, a protein 
5 according to the invention for use in vaccinating a mammal and the use of a protein 
according to the invention in the preparation of a vaccine. 

Optionally, and advantageously, the vaccine of the present invention is combined 
with other immunogens to afford a polyvalent vaccine. In a preferred embodiment 
10 the heterochimeric protein is combined with other subcomponents of RSV t PIV1, 

PIV2, PIV3, MuV or MV, e.g. the single proteins F, G, HN or H or homochimeric 
proteins such as RSV FxG, PIV3 FxHN or MuV FxHN. 

In a particular aspect the invention further provides a vaccine composition 
15 comprising a protein according to the invention together with a suitable carrier or 
adjuvant. 

Vaccine preparation is generally described in New Trends and Developments in 
Vaccines, edited by Voller et al % University Park Press, Baltimore, Maryland, 
20 U.S.A., 1978. Encapsulation within liposomes is described, for example by 
Fullerton, U.S. Patent 4,235,877. 

In the vaccine of the present invention , an aqueous solution of the protein(s) can be 
- . -used -directly . *^erriati.vsly„, >tfce^rcteinv^ r ith<)F with2ut-pri^ 4^pl^lisatian,-Gaii 

25 be mixed, absorbed or adsorbed with any of the various known adjuvants. Such 
adjuvants include, but arc not limited to, aluminium hydroxide, muramyl dipeptide 
and saponins such as Quil A. Particularly preferred adjuvants are, MPL 
(monophosphoryl lipid A) and 3D-MPL (3 deacylated monophpsphoryl lipid A) [US 
patent 4,912,094], optionally formulated with aluminium hudroxide (EP 0 689 454) 

30 or oil in water emulsions (WO 95/17210). A further preferred adjuvant is known as 
QS21 which can be obtained by the method disclosed in US patent 5,057,540. Use 
of 3D-MPL is described by Ribi et al. in Microbiology (1986) Levie et al. feds) 
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Amer. Soc. Microbiol. Wash. D.C., 9-13. Use of Quil A is disclosed by Dalsgaard 
et a/.,(1977), Acta Vet Scand, 18, 349. Use of combined 3D-MPL and QS21 is 
described in WO 94/00153 (SmithKline Beecham Biologicals s.a). QS21 may be 
advantageously formulated with cholesterol containing liposomes, wherein 3D-MPL 
5 is present either in solution or incorporated in the membrane, as described in WO 
96/33739. 



As a further exemplary alternative, a heterochimeric protein of the invention or an 
immunogenic fragment thereof can be encapsulated within microparticles such as 

10 liposomes or associated with oil-in-water emulsions. Encapsulation within 
liposomes is described by Fullerton in US patent 4,235,877. In yet another 
exemplary alternative, a heterochimeric protein according to the invention or an 
immunogenic fragment thereof can be conjugated to an immunostimulating 
macromolecule, such as killed Bordetella or a tetanus toxoid. Conjugation of 

15 proteins to macromolecules is disclosed, for example by Likhite in patent 4,372,945 
and Armor et al. in US patent 4,474,757. 

The amount of the protein of the present invention present in each vaccine dose is 
selected as an amount which induces an immunoprotective response without 

20 significant, adverse side effects in typical vaccines. Such amount will vary 

depending upon which specific immunogen is employed and whether or not the 
vaccine is adjuvanted. Generally, it is expected that each dose will comprise 
l-1000jig of protein, preferably 1-200 jig. An optimal amount for a particular 
-vaccine can J be asctrtaineu r oy standard studies involving observation or antibody 

25 titres and other responses in subjects. 

The following examples and the attached figures (explained below) illustrate the 
invention. 



30 In the Figures: 

Figure 34A shows the impact of humanisation on the level of expression of FrHNp, 
where: 
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FhHNElO = product expressed by the pEE14FhHN transfected clone E10; 
FhHNE7 = product expressed by the pEE14FhHN transfected clone E7; 
FHNbis = product expressed by the pEEl4FHN transfected clone; 
4-but = 2mM Nabutyrate has been added to the cell medium, 3 days before 
5 harvest; 

pEE14 = negative control; 

Fdroso = pruified Fa- (drosophila derived); the standard protein in this ELISA 
assay wherein lul of standard corresponds to lng of product. 
Figure 34B shows humanisation impact on the level of expression of F^yHNp^, 
10 where the level of expression was determined by ELISA. Fdroso = purified Fa- 

(drosophila derived) that is the standard protein in this ELISA assay, lul of standard 
corresponds to lng of product. 
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EXAMPLES 

Example 1 

In order to vaccinate with a single immunogen, heterochimeric DNA molecules 
5 were constructed combining extracellular domains of the F and the attachment 
protein for each virus. DNA constructs for the PIV3 and MuV have already been 
described in WO9306218 and WO9425600, respectively. The DNA molecule 
combining the extracellular domains of the RSV F and G proteins were constructed 
as described below. 

10 

The DNA pieces were first inserted into the mammalian expression vector based on 
the replicon of the Semliki Forest Virus (pSFVl). This expression system does not 
lead to a stable expression mammalian cell line but, however gives an indication 
whether or not the chimeric protein is expressed and whether the product is 
15 effectively secreted in the culture medium, which is advantageous for the 
purification procedure. 

Stable expression in the culture medium of mammalian cell lines is preferred to 
obtain good quality and quantities of paramyxovirus glycoproteins. All the chimeric 

20 modules have been inserted in the shuttle vector, the pEE14, which integrates in the 
genome of mammalian cells such as CHO-K1. A quite good expression level was 
obtained with the RSV FxG homochimeric recombinant protein, however negligible 
expression was obtained for the FxHN recombinant homochimeric protein of either 
^PJy3&rMiiW,. ^Expression^fshetera^ was ^btaine^from--G«3 

25 cells. 

Thus by constructing heterochimeric DNA molecules combining the extracellular 
domains of the F protein of one virus linked to the extra cellular domain of the HN 
or G protein of another virus and inserting them into the pEE14 vector for CHO 
30 expression it has been possible to raise the expression level of these proteins. These 
proteins may be used to achieve protection against at least two paramyxoviridae 
viruses with a single immunogen. 
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Some of the chimeric molecules have been inserted into the shuttle vectors, 
pAcUWSl and pACGP67, which integrate in the genome of bacterial and 
lepidopteran cells. Surprisingly good expression of heterochimeric proteins was 
5 obtained from insect cells. 

Vector construction 
Preliminary Constructs 

10 

a) Plasmid pNIV2819 

Starting from plasmid pNIV2801, a cDNA clone encoding inter alia the F protein 
of RSV (type RSS-2; received from Dr Pringle, UK) we reconstructed a cDNA 
15 module coding for the F protein lacking the membrane anchor sequence. 

Plasmid pNIV2801 was digested with Pstl in order to recover a 1416 bp DNA piece 
encoding amino acid residues 18 to 489 of the F protein. Synthetic 
oligonucleotides, specifying respectively the sequences for amino acids 1 to 17 and 

20 490 to 526, were used to produce the corresponding cDNA fragments by the 
polymerase chain reaction performed with pNIV2801 DNA as template. The 
primers were designed to generate also unique flanking restriction sites useful for 
subsequent cloning steps. The coding module was assembled, by ligation, from the 
- <-hrse .DNA pieces-described -above ^krA -introduced -ntf o ^he^tantiaxti J dtmmg* "verctor 

25 pUC19, to create plasmid pNIV2819. This plasmid encodes the RSV F protein 
carrying its signal sequence but lacking its anchor sequence (figure 1). 

b) Plasmid pNTV 2820 

30 The cDN A. module encoding the full length F protein of RSV was constructed as 
follows. Using two synthetic oligonucleotides, the polymerase chain reaction was 
performed with pNTV2801 DNA as template to generate a 273 bp DNA fragment 
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encompassing the sequence coding for aa 490 to aa 574 of the F protein, the stop 
codon and unique restriction sites useful for subsequent cloning steps. This 
fragment was digested with Nstl and EcoRl and substituted for the Nsil-EcoRl DNA 
piece present in the coding module of pNTV2819 (figure 2). The resulting plasmid, 
5 pNIV2820, thus encodes the RSV F protein carrying both signal and membrane 
anchor sequences. 

c) Plasmid pNTV2841 

10 In this construction, the DNA coding for aa 165 to 176 of the G protein of RSV is 
fused to theT>NA encoding the RSV Fs + a" protein. This part of the G protein is 
conserved among both subgroups of RSV. 

The starting material, pNTV2819, was digested by Ncol and Smal yielding a 1601 
15 bp fragment. This fragment was subcloned into the Ncol and Mscl sites of 

pNTV103 (a derivative of pULB1221, see European Patent Application No. 186643) 
leading to pNIV2844. This subcloning allowed to place the translation initiation site 
of the F protein in a more favourable context according to the model proposed by 
Kozak (Kozak M, Nature 308, 241-246, 1984). 

20 

A 1605 bp fragment was recovered from pNIV2844 by digestion with Kpnl and Sail 
and introduced by ligation into pUC19 digested with Kpnl and Sail, creating 
pNIV2840. 

25 Two complementary synthetic oligonucleotides specifying the sequence for amino 
acids 165 to 176 of the G protein followed by a stop codon and flanked by Nsil, 
BamHi, EcoRl and Hindlll sites were hybridized. The 55 bp resulting fragment was 
cloned into the pNIV2840 digested by AWI.and Hindlll, thus replacing a 142 bp 
DNA sequence encoding amino acids 491 to 526 of the F protein. The resulting 

30 recombinant plasmid, pNIV2841, thus contains the sequence coding for amino acids 
1 to 490 of the F protein followed by amino acids 165 to 176 of the G protein 
(figure 3). 
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Vector Construction 

I) For transfer into the pSFVl vector 

5 a) The RSV fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, (1- 
526) HN^v (60-582). 

Plasmid pNIV2875, a derivative of pNIV2820 which carries the DNA coding for 
10 the F protein of RSV in which the Spel restriction site has been eliminated by site- 
directed mutagenesis into the pUC19 vector, has been digested by Hindlll and 
AspHI, and a 1618 bp fragment has been isolated. Plasmid pNIV3229, a derivative 
of pNIV3215 whose construction has been already described in WO9425600 and 
which carries the DNA coding for the HN protein of MuV into the pUC19 vector, 
15 has been digested with Bbsl and BamHI; a 1580 bp fragment has been isolated. 
Both fragments were linked together by two complementary synthetic BspHI-Bbsl 
oligonucleotides (Fig 4A) restoring the coding sequence of the chimeric molecule 
and were inserted into the BamHl-Hindlll site of the pUC19 vector leading to 
pNIV4102. (Fig4B) After the sequencing of the junction regions, the chimeric 
20 cassette was retrieved from pNIV4102 by a BamYLI digestion and was inserted into 
the BarnUl site of the pSFVl vector (Liljestrbm, P. and Garoff,H. (1991) 
Bio/Technology 9, 1356). The resulting plasmid, pNIV4104, contains into the 
pSFVl vector the sequence coding for amino acids 1 to 526 of the RSV F protein 
^followed %*a23iina.acids 60 4e 5 82. cf -the AfcV-HM protein .-(Fag4Q 

25 

b) The RSV fusion protein lacking the membrane anchor domain fused to the 
PIV3 hemagglutinin-neuraminidase lacking the signal-anchor domain, F^v (1- 
526) HN PIV 3 (70-572). 

30 Plasmid pIBI-HN , a cDNA clone containing the complete coding sequence of 
protein HN of PIV3 as well as its 3* non coding sequence (received from Dr.K. 
Dimock, University of Ottawa, Canada)* has been digested by Asel and BamHl and 
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a 1468 bp fragment has been isolated. Plasmid pNTV2875 (see supra), which carries 
the DNA encoding the F protein of RSV, in which the unique Spel site has been 
eliminated by site-directed mutagenesis, inserted into the pUC19 vector, has been 
digested by BamHl and BspHl % and a 1588 bp fragment has been isolated. Both 
5 fragments were linked together by two complementary synthetic BspHl-Asel 

oligonucleotides (Fig5A) and were inserted into the BamHl site of the pUC19 vector 
leading to pNIV4105 or to pNIV4109 (FigSB) depending of the orientation of the 
chimeric module in the vector. After the sequencing of the junction region, the 
chimeric cassette was retrieved by a BamHl digestion from pNIV4109 and inserted 
10 into the BamHl site of the pSFVl vector. The resulting plasmid, pNIV4110, 

contains, inserted into the pSFVl vector, the sequence coding for amino acids 1 to 
526 of the RSV F protein followed by amino acids 70 to 572 of the PP/3 HN 
protein. (FigSC) 

15 c) The FTV3 fusion protein lacking the membrane anchor domain fused to the 
RSV attachment protein lacking the signal-anchor domain, F PrV r, (1-492) G^y 
(69-298). 

Plasmid pNIV3310, described in WO9306218 which carries the DNA coding for 
20 amino acids 1 to 484 of the PIV3 F protein followed by amino acids 87 to 572 of 
the PIV3 HN protein into the pIBI vector, was digested by EcoRl and Bglll, and a 
1435 bp fragment has been isolated. Plasmid pNIV2850, which carries the RSV G 
protein into the pUC19 vector, has been digested by Maelll and HindRl, and a 694 
bp Yragmenfhas r oeen isolated/Both Ifragments were £heri7inkeQ togelherby using 
25 two complementary BgRl-Maelll synthetic linkers (Fig6A) and were inserted into 
the EcoRL-fJindlll sites of pUC19 vector leading to pNIV4103 (Fig6B). The 
chimeric module was then retrieved from the pUC19 vector by a BamHVHindlll 
digestion. After treating the protruding ends with the Klenow polymerase, the 
chimeric cassette has been inserted into the Smal site of pSFVl vector. The 
30 resulting plasmid pNIV4106, thus. contains the sequence coding for amino acids l.to 
492 of the F protein of PIV3 followed by amino acids 69 to 298 of the G protein of 
RSV inserted into the pSFVl vector (Fig6C). 

-22- 



DEC 04 2000 18=34 



PAGE . 25 



WO 00/18929 



PCT/EP99/07004 



d) The PIV3 fusion protein lacking the membrane anchor domain linked to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, F Prv3 (1- 
493) HN MuV (60-582). 

5 

Plasmid pNIV3310 (see supra, FHN PIV3 in pEBI) was digested by EcoRI and BgUl 
and a 1435 bp fragment was isolated, Plasmid pNIV3229 (see supra, HN MuV into 
pUC19) was digested by Bbsl and Hindlll, and a 1610 bp fragment was isolated. 
Both fragments were linked together by adding two synthetic complementary linkers 

10 specifying a Bglll and a Bbsl ends (Fig7A) into the pUC19 vector leading to 

pNIV4117 (Fig7B). After sequencing the junction region, the chimeric cassette was 
retrieved from the pUC19 vector by a BamRl digestion and was inserted into the 
BamHl site of the pSFVl vector. The resulting plasmid pNIV4118 encodes, cloned 
in the pSFVl vector, the DNA sequence specifying amino acids 1 to 493 of the 

15 PIV3 fusion protein linked to amino acids 60 to 582 of the MuV HN protein 
(Fig7C). 

e) The MuV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking its signal-anchor domain, F^y (1-482) Grsv 

20 (69-298). 

Plasmid pNIV3221, described in WO9425600 which carries the sequence encoding 
amino acids 1 to 462 of the MuV fusion protein within the pUC19 vector, has been 
, digested wi&:£caRI and ,.bp feagment hes heen^purified. -Placid 

25 pNIV3221 has been also digested with BsrFl and ft* I, arid a 628 bp fragment has 
been isolated. Plasmid pNIV2850 (see supra, into the pUC19) has been 
digested with Matffl and Hind/// and a 694 bp fragment has been isolated. The 
three fragments were linked together; the F^v/Grsv junction was created by adding 
to the ligation reaction two synthetic complementary oligonucleotide specifying Pstl 

30 and Afaelll sites (Fig8A), and were inserted into the EcdRl Hindill sites of the 
pBluescript vector leading to pNIV4113(Fig8B). The chimeric cassette was 
recovered from pNTV41 13 by a Asp718l digestion and, after treating the protruding 
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ends with the Klenow polymerase, was inserted into the Smal site of the pSFVl 
vector. The resulting plasmid, pNIV4114 contains into the pSFVl vector the 
sequence specifying amino acids 1 to 482 of the MuV F protein linked to amino 
acids 69 to 298 of the RSV G protein (Fig8C). 

5 

f) The MuV fusion protein lacking Us membrane anchor domain linked to the 
PIV3 hemagglutinin-neuraminidase lacking its signal-anchor domain, F MuV (1- 
482) HN Prv3 (54-572). 



10 Plasmid pNIV4113 (see supra, F MuV x G^v in pBluescript) was digested by Bsal and 
BamHl, a 1469 bp fragment was isolated. Plasmid pNIV3308, described in 
WO9306218 and which carries the DNA sequence specifying amino acids 1 to 31 
followed by amino acids 54 to 572 of the PIV3 HN protein into the pIBI vector, 
was digested by EcoRl and BamHl and a 1569 bp fragment was isolated. Both 

15 fragments were linked together by two synthetic complementary linkers specifying 
Bsal and EcoRl sites (Fig9A) into the BamHl site of pBluescript leading to 
pNIV4115 (Fig9B). The chimeric module was recovered from pNTV4115 by a 
BamHl digestion and was inserted into BamHl site of pSFVl vector. The resulting 
plasmid, pNIV4116, encodes, in the pSFVl vector, the sequence specifying amino 
20 acids 1-482 of the MuV F protein fused to amino acids 54 to 572 of the PIV3 HN 
protein (Fig9C). 

g) The RSV fusion protein lacking its membrane anchor domain linked to the 
^RSV^attachme^^ ^igBc^-H-ntifaur domain, ^^(1-526)^,^(69- 

25 298). 

Plasmid pNIV2857 (Figl6A), a derivative of pNIV2841 and which contains the 
DNA sequence coding for amino acids 1 to 526 of the RSV fusion protein linked to 
amino acids 69 to 298 of the RSV attachment protein, has been digested by Asp718I 
30 and Hindlll and a 2180 bp fragment has been isolated. After treating the protruding 
extremities with Klenow's polymerase, this fragment has been inserted in the Smal 
site of the pSFVl vector. The resulting plasmid pNTV2870, contains in the pSFVl 
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vector, the DNA sequence coding for amino acids 1 to 526 of the RSV fusion 
protein linked to amino acids 69 to 298 of the RSV attachment protein (Figl6B). 

II) For transfection into CHO cells 

5 

a) The RSV fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, (1- 
526) HN MuV (60-582). 

10 Plasmid pNIV4102. (FiglOA, see supra, F MV x HN MuV into the pUC19 vector) has 
been digested with fiamHI, and after treating the protruding ends with the Klenow 
polymerase, the chimeric module has been inserted into the Smal site of the 
glutamine synthetase (GS) vector, pEE14 (Cockett et al, 1990, Bio/Technology 8, 
662-667). The resulting plasmid pEE14 Fs + a RSV x HN s a' MuV contains 

15 sequences coding for amino acids 1 to 526 of the RSV F protein fused to amino 
acids 60 to 582 of the MuV HN protein under the control of the major immediate 
early promoter of the human cytomegalovirus (hCMV-MIE) (FiglOB). 

b) The RSV fusion protein lacking its membrane anchor domain linked to the 
20 PTV3 hemagglutinm-neuraminidase lacking its signal-anchor domain, Frs V (1- 

526) HN PtV3 (70-572). 

Plasmids pNIV4105 and pNIV4109 (FigllA and B, see supra, ¥ ww x HN PtV3 into 
lhe-pU'Ci9"^ector)-^re*dige3ted by -£ceRl "sad Xhol^znS. a 3G32 .bp as AV«U-a£*2 
25 1064 bp fragments were isolated. Both fragments were inserted together into the 
EcoW site of pEE14. The resulting plasmid pEE14 Fs + a RSV x HNs'a' PIV3 
contains sequences coding for amino acids 1 to 526 of the RSV F protein fused to 
amino acids 70 to 572 of the PIV3 HN protein under the control of the hCMV 
promoter (Fig 1 1 C) . 

30 



- 25 - 



DEC 04 2000 .18:35 



WO 00/18929 PCT/EP99/07004 

c) The PIV3 fusion pr tein lacking the membrane anchor region linked to the 
RSV attachment protein lacking the signal-anchor domain, F PrV3 (1-492) 
(69-298). 

5 Plasmid pNIV4103 (Figl2A, see supra, Fp^ x into the pUC19 vector) was 
digested by Hindlll and a 2180 bp fragment was isolated. After treating the 
protruding extremities with the Klenow polymerase, the chimeric module was 
inserted into the Smal site of the pEE14 vector. The resulting plasmid, pEE14 Fs + a" 
PIV3 x Gs a'RSV, contains, under the control of the hCMV promoter, the sequence 
10 encoding amino acids 1 to 492 of the PIV3 F protein followed by amino acids 69 to 
298 of the RSV G protein (Fig 12B). 

d) The PIV3 fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, Fp^ (1- 

15 493) HN MuV (60-582). 

Plasmid pNIV4117 (Figl3A, see supra, F^ HN MuV into the pUC19 vector) was 
digested with Hindlll and a 31 19 bp fragment was isolated and inserted into the 
Hindlll site of the pEE14 vector. The resulting plasmid, pEE14 Fs + a PIV3 x HNs 
20 a" MuV, contains under the control of the hCMV promoter a sequence encoding 
amino acids 1 to 493 of the PIV3 fusion protein fused to amino acids 60 to 582 of 
the MuV HN protein (Figl3B). 

Tlie Mu^f^iGE ^ 

25 RSV attachment protein lacking its signal-anchor domain, F^v (1-482) Grsv 
(69-298). 

Plasmid pNTV4113 (Fig 14 A, see supra, F MuV G^y into the pBluescript vector) has 
been digested Asp 71 SI, the protruding ends have been treated by the Klenow 
30 polymerase. A 2200 bp fragment has been isolated and inserted into the Smal site of 
pEE14. The resulting plasmid, pEE14 Fs + a~ MuV x Gs'a'RSV, has, under the 
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control of the hCMV promoter, the sequence encoding amino acids 1 to 482 of the 
MuV F protein followed by amino acids 69 to 298 of the RSV G protein (Figl4B). 



f) The MuV fusion protein lacking its membrane anchor domain fused to the 
5 PIV3 hemagglutinin-neuraminidase lacking its signal-anchor domain, F MuV (1- 
482) HN Prv3 (54-572). 

Plasmid pNIV4115 (Figl5A, see supra, F MuV x HN prV3 into the pBluescript vector) 
has been digested with EcoRl and a 3040 bp fragment has been inserted into the 
10 EcoRI site of the pEE14 vector. The resulting plasmid, pEE14 Fs + a MuV x HNs a 
PIV3, contains, downstream to the hCMV promoter region, a sequence coding for 
amino acids 1 to 482 of the MuV F protein followed by amino acids 54 to 572 of 
the PIV3 HN protein (Figl5B). 

15 g) The RSV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking its signal-anchor domain, F^y (1-526) 0^(69- 
298). 

Plasmid pNIV2857 (Figl7A), a derivative of pNIV2841 and which contains the 
20 DNA sequence coding for amino acids 1 to 526 of the RSV fusion protein linked to 
amino acids 69 to 298 of the RSV attachment protein, has been digested by Asp718I 
and Hindlll and a 2180 bp fragment has been isolated. After treating the protruding 
extremities with Klenow's polymerase, this fragment has been inserted the Smal site 
cf-the-pEEMvector. The-resuliing -plasmid; pEE i'4 i^VRSV x* r Gs a TlSV, contams 
25 under the control of the hCMV promoter the DNA sequence coding for amino- acids 
1 to 526 of the RSV fusion protein linked to amino acids 69 to. 298 of the RSV 
attachment protein (Figl7B). 

h) The original RSV fusion protein lacking the membrane anchor domain 
30 linked to the PIV3 hemagglutin-neuraminidase lacking the signal-anchor 
domain, F wv (1-526) HN Piy3 (70-572) bis. 
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Plasmid pNIV2852, a derivative of pNTV2820 which carries the DNA encoding the 
RSV F protein where the translation initiation site is in a more favourable context 
according to the model proposed by Kozak (Kozak M, Nature 308, 241-246, 1984), 
has been digested BamHI and BspHI, and a 1588 bp fragment has been isolated. 

5 

Plasmid pIBI-HN, a cDNA clone containing the complete coding sequence of the 
HN protein of PIV3 (received from Dr. K. Dimock, University of Ottawa, Canada) 
has been digested by Asel and BamHI and a 1468 bp has been isolated. 

10 Both fragments were linked together by two complementary synthetic BspHI-Asel 
adaptators (Fig 18 A) and were inserted into the BamHI site of the pUC19 vector 
leading to pNIV4120 (Figl8B). 

After the sequencing of the junction region, the chimeric cassette was retrieved by a 
15 BamHI digestion from pNIV4120 and inserted into the BamHI compatible Bell site 
of the pEE14 vector. The resulting plasmid pEE14 Fs + aRSV x HNs'a" PIV3 bis 
contains the sequences coding for amino acids 1 to 526 of the RSV F protein fused 
to amino acids 70 to 572 of the PIV3 HN protein under the control of the hCMV 
promoter (Figl8C). 

20 

This construct differs from the earlier pEE14 Fs + a"RSV x HNs a~ PIV3 construct 
(Il-a) in the F coding region. In FrsvHNp^ bis, the nucleic acid sequence found in 
F R5V HN PIV3 , ATG GAT CTG (those codons are specifying aa Metl, Asp2 and Leu3) 
. . smd^ACC^GT Xspeca jying.aa .J&ES&aad .Ser ~55j-is ^placed- J3y>Jhe^xigMi?J 
25 sequence of the RSV F protein that is ATG GAG TTG (specifiyng aa Metl , Glu2, 
Leu3) and- ACT AGT (specifying Thr54 and Ser55). 

i) The original RSV fusion protein lacking the membrane anchor domain linked 
to the PTV3 hemagglutinin-neuraminidase lacking the signal-anchor domain 
30 with, at the C-terminal part, a poiyhistidine tail preceded by the enterokinase 
cleavage site, (1-526) HN PIV3 (70-572) en his 
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Plasmid pIBI-HN, a cDNA containing the PIV3 HN protein coding sequence (see 
supra) has been digested by PstI and Sphl. A 4588 bp fragment has been isolated 
and linked to complementary synthetic Pstl-SphI adaptators (Figl9A). 

5 After the sequencing of the junctions as well as the synthetic linkers, the resulting 
plasmid pNIV3340 has been digested by Xhol and BamHI and a 1 121 bp fragment 
has been isolated (Figl9B). 

Plasmid pNIV4120 (see supra) has been digested by Xhol and BamHI and a 2017 bp 
10 fragment has been isolated (Figl9C). 

Both fragments were linked together and inserted into the BamHI compatible Bell 
site of the pEE14 vector. The resulting plasmid pEE14 FRSVs + a" x HNs a en his 
contains, under the control of the hCMV promoter, sequences coding for amino 
15 acids 1 to 526 of the RSV fusion protein fused to the amino acids 70-572 of the 
PIV3 HN protein fused to the enterokinase cleavage site, ({Asp} x4 Lys) followed 
by a polyhistidine tail ({his}x6) and a stop codon (Figl9D). 

20 j) The signal domain of the tissue plasminogen activator fused to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and the 
original RSV fusion protein lacking its membrane signal and anchor domains 
linked to the PIV3 hemagglutin-neuraminidase lacking the signal-anchor 
.^mainrsWA^2^-lJB<>I-^) VEt-F^ {24*S2fi) iJN PIV 3 f7G^572)tis. 

25 

1) The signal domain of the tissue plasminogen activator fused to the yeast 
ubiquitin. 

A 208 bp fragment corresponding to amino acid 1 to 76 of the ubiquitin protein of 
30 Saccharornyces cerevisiae was isolated by a digestion of pNIV3475 ( a derivative of 
YEPUBSTUALL, a yeast 2 vector backbone carrying the yeast ubiquitin) with 
BamHI and Xbal (Fig 20 A). 
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Plasmid JW4304 (received from J. Mullins, University of Washington, U.S. A) 
which encodes the signal domain of the tissue plasminogen activator (sTPA) was 
digested by Nhel and BamHI and a 5115bp was isolated. Both fragments were 
5 linked together using two synthetic complementary Nhel-Xbal adaptators (Fig20B). 
The resulting plasmid pNTV4121 was digested by Hindlll and BamHL A 330 bp 
fragment was isolated and inserted into the Hindlll and BamHI sites of the 
pBluescript vector. The resulting plasmid pNIV4122 contains the DNA sequence 
specifying the signal domain of the tissue plasminogen activator followed by an 
10 alanine and a serine residue (those two amino acids are known to produce a good 
leader cleavage) fused to the yeast ubiquitin (Fig 20C). 

2) The signal domain of the tissue plasminogen activator linked to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and amino acid 
15 24 to 55 of the original fusion protein of RSV. 

Plasmid pNIV4122 (Fig 21 A, see supra) was digested by AflH and Spel. A 3212 bp 
fragment was isolated and linked to synthetic complementary Aflll-Spel adaptators 
(Fig21B). The entire module was then sequenced. The resulting plasmid pNIV4123 
20 encodes the signal domain of the tissue plasminogen activator linked to the N- 

terminal 74 aa of the yeast ubiquitin followed by the recognition site of enterokinase 
{(Asp)4 Lys} and amino acid 24 to 55 of the original fusion protein of RSV 
(Fig21C). 

25 3) The signal domain of the tissue plasminogen activator linked to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and the RSV 
fusion protein linked to the PIV3 hemagglutin-neuraminidase lacking their 
membrane domains. 

30 Plasmid pNIV4123 (Fig 22 A, see supra) was digested by Hindlll, treated by the 
Klenow polymerase and digested by Spel. A 408 bp fragment has been isolated. 
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Plasmid pNTV4120 (Fig 22B, see supra) has been digested by Xbal y treated by the 
Klenow polymerase, and digested by SpeL A 5620 bp fragment has been isolated. 

Both fragment have been linked together to generate pNIV4124 (Fig 22C). 

5 

The entire coding module was retrieved from pNIV4124 by a digestion with Xbal 
and EcoRI and was inserted into the Xbal and EcoRI sites of the pEE14 expression 
vector. The resulting plasmid pEE14 sTPA x UBI x EN x FsaRSV x HNs a PIV3, 
contains, under the control of the hCMV promoter, the sequence coding for aal-21 
10 of the tissue plasminogen activator followed by an alanine and a serine residue, by 
the 74 N-terminal amino acids of the yeast ubiquitin, by the recognition cleavage 
site of the enterokinase ({Asp}4 Lys), by aa 24-526 of the original RSV fusion 
protein and by aa 70-572 of the hemagglutin-neuraminidase of PIV3. 

15 III) For transfection into Insect Cells 

a) The original RSV fusion protein lacking the membrane anchor domain 
linked to the PiV3 hemagglutin-neuraminidase lacking the signal-anchor 
domain, (1-526) HN^ (70-572) bis. 

20 

Plasmid pNTV4120 (FIG 23 A) was digested by BamHI and a 3114 bp fragment was 
isolated and inserted into the BamHI site of the baculovirus transfer vector, 
pAcUWSl (PharMingen). The resulting plasmid pNTV4132 (Fig 23B) contains, 
usder iks -control ef the poly hedrin-preffBGter, the seq^nce xoding *fox ^airrino "acids 
25 1-526 of the RSV F protein fused to amino acids 70-572 of the PiV3 HN protein. 

b) The baculovirus gp67 signal peptide fused to the original RSV fusion protein 
lacking both membrane signal and anchor domain linked to the PiV3 
hemagglutin-neuraminidase lacking the signal-anchor domain, SGP67FRSV (25- 

30 526) HN W3 (70-572) bis. 
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Plasmid pNIV4120 (FIG 24A, see supra) was digested by BamHI and Spel and a 
2939 bp fragment was isolated, linked to two complementary synthetic BamHI-Spel 
adaptators and inserted into the BamHI site of the baculovirus transfer vector, 
pAcGP67A (PharMingen). The resulting plasmid pNIV4136 (Fig 24) contains, 
5 under the control of the polyhedrin promoter, the sequence coding for amino acids 
1-38 of the Baculovirus gp67 protein, followed by an Alanine and an Aspartate 
linked to amino acids 25-526 of the RSV F protein fused to amino acids 70-572 of 
the PiV3 HN protein. 

10 Expression in eukaryotic cells 

A) via the pSFVl vector 

The pSFVl vector is based on the Semliki Forest Vims (SFV) replicon. The DNA 
15 of interest is cloned into the pSFVl vector that serves as a template for in vitro 

synthesis of recombinant RNA. The RNA is transfected into mammalian cells such 
as BHK-21 cells. The recombinant RNA in the cells drives its own replication and 
capping resulting in production of heterologous protein. 

20 Plasmids pNIV2870 was digested with Pvul; pNTV4106, pNIV4110, pNIV41 14, 
pNIV4116 and pNIV4118 were digested with Spel prior to RNA transcription. 
After a phenol extraction followed by an ethanol precipitation, 2 pig of linearized 
DNA was used as a template for RNA production. About 5 /xg RNA was used to 
/ttassfe^ All -experiment^; 

25 procedures for RNA production and cell transfection are detailed in Liljestrom and 
Garoff (Bio/Technology, 1991, 9, 1356). 

After 24 h to 48 h post-electroporation, cells and spent culture medium have been 
collected for ELISA and radioimmunoprecipitation assays. 
30 a) P N1V41Q4, HN MuV 
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ELISA were done using mAb 2072 anti-HN MuV (Orvell, 1984, 7. Immunology 
132, 2622-2629) or 20RG45, a goat anti-RSV serum (Fitzgerald, U.S.A.) to coat 
the microtiter plates and a rabbit polyclonal anti-SBL-1 (MuV) serum or mAbl9 
anti-F RSV (G.Taylor, Inst, of Animal Health, Compton Lab., U.K.) as capture 
5 antibody. 

Radioimmunoprecipitation of the 33 S-methionine labelled product was done using 
mAb2072 (Orvell) and products were resolved onto 7.5% SDS-PAGE. 

10 b) pNTV4110, Frsv HNp^ 

ELISA were done using anti-RSV goat serum 20RG45 or mAb anti-HN nv3 4830 
(Rydbeck et al, J. Gen. ViroL 67, 1531-1542, 1986) to coat microtiter plates and 
mAbl9 anti-F RSV (G.Taylor) or rabbit anti-PIV3 (E.Norrby, Stockholm) serum as 
15 a capture antibody. 

Radioimmunoprecipitation was done using anti-HN PIV3 mAb4830. 

c) pNTV4106, F PIV3 

20 

ELISA were done using mAb anti-F PIV3 4549 (E.Norrby, Stockholm) or mAb anti 
Grsv 858-2 (Chemicon, U.S.A.) to coat microtiter plates and a rabbit anti-PIV3 
serum as a capture antibody. 

25 Radioimmunoprecipitation was done using mAb anti-F PrV 3 3283 (Behringwerke). 

d) pNTV4118» F prv3 HN MuV 

ELISA plates were coated with anti-F PIV3 mAb 1031215 (Nonby) or with mAb 
30 2072 anti-HN MuV (Orvel) and rabbit anti-PIV3 sera or rabbit anti-MuV sera were 
used as capture antibody. 

Immunoprecipitation of labelled product was done using mAb 2072 anti-HN MuV. w 
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e) pNIV4114, F MuV x 

ELISA plates were coated with anti-F MuV monoclonal 5414 (Orvell) or anti 
5 mAb (Chemicon) and a rabbit ami-SBL-1 senim was used as a capture antibody. 

f) pNIV4116, F MuV x HNpiv3 

ELISA plates were coated with anti-F MuV mAb 5414 (Orvell) or mAb anti-HN 
10 PIV3 4830 (Norrby) and rabbit anti-SBL-1 serum or a rabbit anti-PIV3 serum as a 
capture antibody. 

g) pNIV2870, F RSV x 

15 ELISA were done using 20RG45, a goat anti-RSV serum (Fitzgerald, U.S.A.) to 
coat the microtiter plates and mAbl9 anti-F RSV (G.Taylor, Inst, of Animal 
Health, Compton Lab., U.K.) as capture antibody. 

20 B) Expression in CHO cells (stable transformants) 

All recombinant plasmids were transfected by calcium phosphate coprecipitation 
into CHO-KI cells, using 20 /ig DNA per 1.25 10 6 cells. The CHO-KI cells were 
jrzDwvJzi GMEMtS ^aaediusn, . The -GS -trans feotants -were fislsGted-by ^adding 25 

25 methionine sulfoximine to the culture medium two days after transfection. After ten 
to fourteen days, resistant colonies were picked and transferred into 96 wells plates. 
Each transformantwas then transferred into 24 wells plates and subsequently to 80 
cm 2 flasks. The GS transformants were assayed for the recombinant products when 
cells reached about 80% confluency. The procedure, follows the one described in 

30 Cockett et al (Bio/Technology, 1990, 5, 662-667). 
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ELISA and immunoprecipitation of radiolabelled products were done using the same 
procedures as the ones described above for the pSFVl system. 



Results 

5 
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Expression in Insect cells 

10 a) Expression in lepidopteran cells. 

The vector pAcUW51 is a shuttle vector for bacteria and lepidopteran cells. A 
heterologous protein coding sequence can be inserted downstream the baculovirus 
plO promoter or either downstream the polyhedrin promoter. 

15 

The pAcGP67 vector is a shuttle vector for bacteria and lepidopteran cells that 
contains the gp67 signal sequence upstream a multiple cloning site. A heterologous 
gene can be inserted in one of the cloning site and will be expressed as a gp67 
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signal peptide fusion protein under the control of the polyhedrin promoter. The 
gp67 signal peptide mediates the secretion of the recombinant protein. 

Either pAcUWSl or pAcGP67 recombinant plasmid can be transfected along with 
5 baculovinis linearised DNA into Sf9 cells (Baculogold DNA, PharMingen). This 
leads to the generation of a recombinant baculovinis stock. The expression of the 
recombinant heterologous protein is obtained by infecting insect cells with the 
recombinant baculovinis 

10 Plasmid pNIV4132 or plasmid pNIV4136 were transfected with baculovinis linearised 
DNA into Sf9 cells. Recombinant baculovinis 3546 (derived from cells transfected by 
pNIV4132) or 5V (derived from cells transfected by pNIV4136) were plaque purified 
and were used to infect Sf9 or High Five™ cells (Invitrogen). 24h to 72 h post- 
infection the cells and the spent culture medium have been collected for ELISA and 

15 Western blot analysis. 

ELISA were done using anti-RSV goat serum 20RG45 (Fizgerald) to coat microtiter 
plates and mAbl9 anti-F RSV (G.Taylor) as a capture antibody. 

20 Western blots were done using mAbl9 anti-F RSV (G.Taylor) or using anti-RSV 
goat polyclonal serum 20RG45 (Fizgerald). 

The spent medium from cells infected by either baculovinis 3546 or by 5V tested 
^positi ve. in .ELISA, JEhe Jewrf .of expression , ^ d^sndis-g-G-n -the im&i eeH <Iine"fSF9«xjr 
25 High Five), multiplicity of infection, medium (fetal calf serum supplemented or 
serum free synthetic medium) was at least ten times higher than the one obtained 
with a recombinant CHO-KI clone obtained by transfection with pEE14 F^ (1- 
526) HN PiV3 (70-572)bis . 

30 In addition, the spent , medium of the baculovinis infected cells reacted positively in 
Western blot. A band in the vicinity of 1 lOkDa was present in the immunoblots. These 
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results confirm the secretion of the chimeric Fr S v-HNp,-v3 into the medium of Sf9 and 
High Five cells infected with the recombinant baculoviruses. 



b) Purification of the recombinant product 

5 

SF9 cells, adapted to serum free medium, were infected with the plaque purified 
recombinant baculovirus V5 or 3546. The cells were grown in suspension in 500ml 
Erlenmeyer flask in SF900II medium (Gibco BRL). The medium from virus infected 
cells were harvested two days post-infection. The soluble FR S v-HN P iv3 product was 

10 purified from the medium of infected cells by immunoaffinity chromatography using an 
anti-F RSV monoclonal antibody, mAbl9. The anti-F monoclonal antibody was 
coupled to Activated CH Sepharose 4B (Pharmacia) following the manufacturer 
instructions. The immunoaffinity gel was washed 3 times with 10 bed volumes of 
buffer A (20mM phosphate buffer pH 6.4, NaCl 150mM) prior to sample loading. 

15 After 16 hours at 4°C, the gel was washed with buffer A and the chimeric product was 
eluted with lOOmM phosphoric acid. Eluted protein was neutralized immediately with 
one tenth of volume of 1M phosphate buffer pH 7. 

SDS-PAGE of the inununoaffinity-purified F RS v-HNpj V3 revealed the presence of a 
20 major protein band of about 1 10 kDa. This protein was visualized by Coomassie 

blue staining of the gel and reacted with the monoclonal antibody anti-F^y (mAbl9) 
or with the polyclonal serum (20RG45) on immunoblots (Fig25). 

c) ^producti-on-of ^polyclonal antiboxlies 

25 

In order to obtain specific antibodies, the baculovirus derived Frsv-HN P jv3 protein, 
purified by immunoaffinity as described above, was used to immunise four BalbC mice 
and two New Zealand white rabbits. Three sub-cutaneous injections of 
20(ig/ml/dose/rabbit or 6^g/100^il/dose/mouse were done at three weeks interval. The 
30 sera were collected 3 weeks after the second and the third injection and the antibody 
response was detected using ELISA and Western blots assays. 
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1) ELISA assays 

a) Mice response 

The antibody response was followed using a goat anti-RSV serum (20RG45, 
Fitzgerald, USA) to coat the microliter plates and mouse anti-FHN sera as capture 
5 antibody. The antigens used were either the Fa^y-Drosophila or CHO derived, the 
Fusv-HNpiva expressed in baculovinis and the medium of CHO cells transfected by 
the pEE14 was used as a negative control. 

3 our of 4 mice sera collected after the second injection showed some but low 
10 specific response. However, the mice sera collected after the third injection showed 
a high increase in level of specific antibodies. 

b) Rabbit response 

The antibody response was followed using either one of the following ELISA. The 
15 antigens were the same as the one used to detect the mice antibody response. 

Either a goat anti-RSV serum (20RG45, Fitzgerald, USA), either a monoclonal 
antibody directed against the RSV fusion protein (mAbl9, Compton Lab, UK) or a 
monoclonal antobody directed against the PiV3 hemagglutinin-neuraminidase 
(mAb3285, Behring) were used to coat the microliter plate and the rabbit anti-HN sera 
20 was used as a capture antibody. The first and the second test bleeds generated high 
specific antibodies 

2) Western blot assays 

Recombinant J&-RSV XHO-JG .ou JHrasophila derived; ,E^ V .?HN?W3 vbacuJo-wiis 
25 derived or the CHO-pEE14 spent medium culture were electrophoresed onto a 15% 
SDS-PAGE and transferred onto a nitrocellulose membrane (Amersham). The rabbit 
anti-HN sera as well as the mouse anti-HN sera detected specifically either the F 
protein or the FRsv-HNrw3 chimera. 



- 38 - 



DEC 04 2000 18:39 



PRGE.41 



WO 00/18929 



PCT/EP99/07004 



Example 2 

i) Optimization of the codon usage of the nucleic acids sequence coding for the 
RSV fusion protein lacking the membrane anchor domain linked to the PiV3 
hemagglutin-neuraminidase lacking the signal-anchor domain, (1-526) 
5 HN^ (70-572) for the expression in mammalian cells. 

A table showing the comparison of the codon usage found in the FrsvHNpjvb module 
with the one found in highly expressed human gene can be found in Fig. 26. As 
noted, the most prevalent codons found in the F^vHNnvi module have an A or a T 
10 at their third degenerative position, whereas the human prevalent codons have a C 
or a G. For the improvement of the F^yHNp^ protein expression, the entire coding 
sequence has been re-engineered to fit at best the human codon usage. The re- 
engineered sequence was obtained using synthetic long oligonucleotides, polymerase 
chain reaction (PCR) and conventional cloning procedures. 

15 

Re-engineering of the coding sequence of the F^HNpiVB module 
The entire synthetic sequence was recovered by joining three PCR fragments (A, B 
and C). The general strategy to obtain each PCR fragment is schematically 
represented in Fig 27. It consists of assembling overlapping long oligonucleotides in 
20 a first round amplification. The resulting full size fragment is further amplified 
using two short primers located on each of its extremities. 

Construction of fragment A 

The first JR.CR fragment, corresponding to 18 bases encoding restriction sites 
25 followed by bases 1 to 1269 of the F^yHNp^ followed by 8 bases encoding 
restriction sites, was obtained by PCR assembly of 18 overlapping oligonucleotides 
(Fig 28). This fragment has been inserted in the pCRIITOPO cloning vector 
(Invitrogen). After sequencing the fragment, it was retrieved frqm the pCRIITOPO 
vector by a Xbal and BsrCI digestion and inserted into the corresponding sites of 
30 pNIV4120. The module corresponding to F^yHNp^ with bases 1 to 1264 
humanized was then retrieved by an Xbal and EcoRI digestion and inserted into the 
corresponding sites of pEE14 (Fig.29) generating pEE14xF KV kuniHNp iV3 . 

-39- 



DEC 04 2000 18M0 



PPGE.42 



WO 00/18929 



PCT/EP99/07004 



Construction of fragment B 

The second PCR fragment B corresponding to 13 bases encoding unique restriction 
sites followed by bases 1264 to 2136 of FrsvHNp^ was obtained by assembling 10 
oligonucleotides whose sequences can be found in Fig. 30. This fragment has been 
5 inserted in the pCRIITOPO vector and sequenced. This fragment has been 
recovered by a BsrGI and Kpnl digestion. 

Construction of fragment C 

The third PCR fragment corresponding to bases 2023 to 3090 followed by 6 extra 
10 bases encoding an EcoRI site has been assembled starting from the 15 
oligonucleotides shown in Fig 31. This fragment has been inserted in the 
pCRJITOPO cloning vector and sequenced. This fragment has been retrieved by a 
Kpnl and EcoRI digestion (Fig 31). 

15 Construction of the entire coding sequence 

The entire F^yHNp^j codon optimized coding sequence has been obtained by 
assembling fragment A, B, C as shown in Fig. 32. pNTV4120 in which the PCR 
fragment A has replaced the original sequence (see Fig. 29) was digested by BsrGI 
and EcoRI. The original sequence was eliminated and replaced by the BsrGI- Kpnl 

20 fragment B and the KpnI-EcoRI fragment C. The codon optimized module was 

retrieved from the PCRIITOPO vector by a Xbal and an EcoRI and inserted in the 
corresponding sites of the pEE14 vector. The resulting plasmid, pEEKF^ 
humHN PiV3 hum, encodes for the entire humanized coding sequence. The humanized 
"Fjtfv^Vivl^^to^a^^ Fig. 33. 

25 

Expression in CHO-KI cells 

The recombinant pEE 14 F^y humHN PiV3 (see construction of fragment A, above, or 

recombinant pEEMF^vhurnHNp^hum see construction of the entire coding 

30 sequence, above) was transfected using the FuGene reagent (Boeringer Mannheim), 

using 5 jig DNA per 1.25 10 6 cells. The CHO-KI cells were grown in GMEMrS 

medium: The GS transfectants were selected by adding 25 fiM methionine 

sulfoximine to the culture medium two days after transfection. After ten to fourteen 
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days, resistant colonies were picked and transferred into 96 wells plates. Each 
transformant was then transferred into 24 wells plates and subsequently to 80 cm 2 
flasks. The GS transformants were assayed for the recombinant product when cells 
reached about 80% confluency. The procedure follows the one described in Cockett 
5 et al (Bio/Technology, 1990, 8, 662-667). Alternatively, the expression was 

evaluated three to five days after the addition of sodium butyrate (2mM) in the cell 
culture. 

To compare the expression level to that of the non humanized FRsvHNp iV3 , ELISA 
10 assays were done, using 20RG45, a goat anti-RSV serum (Fizgerald, U.S.A.) to 
coat the microtiter plates and mAbl9 anti-F RSV (G. Taylor, Inst, of Animal 
Health, Compton Lab, U.K.) as capture antibody. The expression level was 
estimated using a purified Fa-Rsv expressed in the Drosophila system. 



15 The level of expression of the non-humanized expressed product by 

pEEMFRsvHNpiva didn't exceed 0.03 mg/L and 0.1 mg/L when sodium butyrate 
was added to the culture medium. The level of expression of the partially 
humanized product expressed by pEEHF^v humHN PiV3 , reached 1 mg/L and up to 
3 mg/L when sodium butyrate was added in the culture medium. The humanization 

20 of the sequence coding for amino acids 1-423 of the 1029 amino acids thus 
enhanced the level of expression up to 30 fold (see Figure 34a). 

The level of expression of the entirely humanized product expressed by pEEHF^y 
^wmHNp^iii^'V^Svat least -of 2 mg/L and reached^p^tG 5Q'm%fLwh€Ti^®3imn 
25 butyrate was added in the culture medium. The humanization of the entire coding 
region of F^yHNpivr, thus enhanced the level of expression of at least. 200 to 500 
fold (see Figure 34b). 

ii) Optimization of the codon usage of the nucleic acids sequence coding for the 
30 mumps virus (MuV) fusion protein lacking the membrane anchor domain 
linked to the measles virus (MV) lacking the signal-anchor domain, F Muv (1-482) 
H Mv (59-617) for the expression in mammalian, cells. 
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A table showing the comparison of the codon usage found in the F M uvHmv module with 
the one found in highly expressed human gene can be found in Fig. 35. As it can be 
seen, the codon usage frequencies of this chimerical gene is quite different from those 
5 prevalent in the human genome. For the improvement of the FmuvHmv protein 
expression, the entire coding sequence has been re-engineered to fit at best the human 
codon usage. The re-engineered sequence was obtained using synthetic long 
oligonucleotides, polymerase chain reaction (PCR) and conventional cloning 
procedures. 

10 

Re-engineering of the coding sequence of the F Mu vH M v module 

The entire synthetic sequence was recovered by joining four PCR fragments 
(A, B, C and D) The general strategy to obtain each PCR fragment is schematically 
represented in Fig 36. It consists of assembling overlapping long oligonucleotides in a 
15 first round amplification. The resulting full size fragment is further amplified using two 
short primers located on each of its extremities. 

Construction of fragment A 

The first PCR fragment, corresponding to 13 bases specifying restriction sites and a 
20 Kozak consensus motif followed by bases 1 to 1026 of the F^vHj^v was obtained by 
PCR assembly of 12 overlapping oligonucleotides (Fig 37). This fragment has been 
inserted in the pCRIITOPO cloning vector (Invitrogen). After sequencing the 
fragment, it was retrieved from the pCRIITOPO vector by a Xbal and TspRJ 
. . . ..digestion and;a:;963 Jhp ixagmenr Avas fuTti^^imnified^ leading^© fxagmecl-A 

25 

Construction of fragment B 

The second PCR fragment B corresponding to bases 965 to 1712 of FmuvHmv was 
obtained by assembling 9 oligonucleotides whose sequences can be found in Fig. 38. 
After its insertion into the pCRIITOPO vector and its sequencing, this 785 bp 
30 fragment has been recovered by a TspRI and Aval digestion. 
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Construction of fragment C 

The third PCR fragment C corresponding to bases 1712 to 2485 has been assembled 
starting from the 11 oligonucleotides shown in Fig 39. It has been inserted in the 
pCRIITOPO cloning vector and sequenced. This 774 bp fragment has been retrieved 
5 by an Aval and Apal digestion. 

Construction of fragment D 

The fourth PCR fragment D corresponding to bases 2485 to 3139 followed by 8 bp 
specifying a unique restriction site has been assembled starting from the 8 
10 oligonucleotides shown in Fig 40. This fragment has been inserted in the 
pCRIITOPO vector and sequenced. A 657 bp fragment has been recovered after an 
Apal and EcoRI digestion. 

Construction of the entire coding sequence 

15 The entire F MuV H Mv codon optimised coding sequence has been obtained by 
assembling fragment A, B, C, D and inserting the module digested by Xbal and 
EcoRI into the corresponding sites of the pEE14 vector (Fig. 41). The resulting 
plasmid, pEEMFMuyhumH^hum, encodes for a humanised sequence coding for aa 
1-482 of the mumps virus fusion protein followed by aa 59-617 of the measles 

20 virus. The humanised and original F^K^ nucleic and amino acids sequences are 
shown in Fig. 42. 

iii) Purification and analysis of FHN expressed in CHO-KI 

25 a) Purification 

CHO cell line expressing secreted recombinant FHN was cultivated in cell factories in 
G-MEM medium supplemented with 2% FCS, in presence or absence of 1% Butyrate 
Na. FHN was purified by immunoaffinity chromatography by loading spent culture 
medium onto aMabl9-sepharose column as described using the same experimental 
30 conditions. 
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When expressed in absence of Butyrate Na, purified FHN migrated on SDS-PAGE, in 
heating and reducing conditions, mainly as a band of 1 10 kDa. In contrast, FHN is 
visualized as a triplet of 1 10, 120 and 130 kDa when CHO cells are cultivated with 
butyrate. Heating has a more drastic effect than reducer on the FHN electrophoretic 
5 migration. Indeed, high molecular weight species are clearly detected in the 

preparation when electrophoresis proceeded without heating suggesting the presence 
of FHN aggregates or oligomers. These aggregates did not seem to be contaminated 
by CHO proteins. Antibodies directed to CHO proteins did not specifically recognize 
on Western blot any bands. Glycan analysis was performed using several lectins 
10 specific for different carbohydrate moieties. Surprisingly, FHN did not carry sialic 

acids or high-mannose structures but carbohydrates of galactose-acetyl-galactosamine 
type characteristic of hybrid N- and/or O-glycosylations, 

N-terminal microsequence analysis showed mainly the presence of Fl subunit in bands 
15 of 1 10-130kDa. The F2 N-terminal amino acid sequence detected in bands of lower 
and higher molecular weight indicated that some purified FHN molecules are present 
under a F0 form (non mature F) 

The presence of aggregates or oligomers in the FHN preparations was confirmed by 
20 gel filtration analysis and proteins were detected by laser-light scattering. Whatever the 
culture conditions (butyrate or not), between 50 and 65% of FHN populations 
displayed a molecular weight higher than 10 G Da demonstrating that FHN is 
aggregated. 5 to 15% has a molecular weight ranging from 400 to 900 kDa whereas 
30 to 35% is>mcnomeric FHN. 

25 

b) Serum immunoglobin analysis. 
Immunisation protocol 

The Fr S vHNp,-v3 protein was purified from the spent medium culture of the CHO-KI 

30 cells trahsfected by the recombinant pEE 1 4 FRsvhumHNp,-v3hum by immunoaffinity 

chromatography as described (Purification of the recombinant product expressed in 

baculovirus recombinant infected SF9 cells). The product was injected in 7 groups of 

Balb CI mice as descibed in the following table 1. 
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Humoral response directed against the FHN protein 

The humoral response directed against the FHN protein was determined. To this end, 
ELIS A plates were coated with immunoaffinity purified FHN protein. 

5 

Total IgG (Fig 43) 

To detect specific anti-FHN total IgG, ELISA plates were coated with 200ng of 
immunoaffinity purified FHN protein, plates were then saturated and dilutionsof the 
mice second bleed sera were then applied. Total IgG were detected using a biotinylated 
10 serum directed against mouse IgG. 

IgGl (Fig 44) 

To detect specific anti-FHN IgGl, ELISA plates were coated with lOOng of 
immunoaffinity purified FHN protein, plates were then saturated and dilutionsof the 
15 mice second bleed sera were then applied, IgGl were detected using a biotinylated 
serum directed against mouse IgGl. 

IgG2a (Fig 45) 

To detect specific anti-FHN IgG2a, ELISA plates were coated with lOOng of 
20 immunoaffinity purified FHN protein, plates were then saturated and dilutionsof the 
mice second bleed sera were then applied. IgG2a were detected using a biotinylated 
serum directed against mouse IgG2a. 

The titer of-each sera was determined..and.a.mean titer for each group was calculated 
25 and is reported, in table 2. These experiments show that the FHN antigen by itself or 
formulated with adjuvant (group 1 to 3), stimulates a specific humoral response. 
Indeed, no anti-FHN antibodies are generated in the untreated mice group (group 5) or 
in the group immunised solely with the adjuvant (group 4). The group 1 (and group 4) 
adjuvant was 3D-MPL arid QS21 formulated with eholesterol containing liposomes as 
30 described in WO 96/33739; the group 2 adjuvant was alum. 

The IgGl/IgG2a ratio indicates the Thl or Th2 orientation of the immune response; 
(Table2), a protective response against both the RSV or the PiV3 should tend toward* 
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the Thl type, that is a low IgGl/IgG2a ratio. In this regard, the responses generated 
with the FHN formulated in the presence of the 3D-MPL + QS21 adjuvant appears to 
be the more promising one. 

5 Table 1: Experimental procedures 
Immunogenicity FHN in 
mice 



Group 


n 


Vol 
(ul) 


route 


Antigen 


Immuno- 
stimulants 


buffer 


preservative 


nature 


dose 
(M8) 


1 


12 


2x50 


IM 


FHN 


2 


3D-MPL/ 
QS21 


PBS mod 
pH7.4 


thiomersal low 
(lug/ml) 


2 


12 


2x50 


IM 


FHN 


2 


Al(OH)3 


PBS mod 
pH7.4 


thiomersal low 
(lug/ml) 


3 


12 


2x50 


IM 


FHN 


2 


/ 


PBS mod 
pH7.4 


thiomersal low 
(lug/ml) 


4 


12 


2x50 


IM 


/ 


/ 


3D-MPL/ 
QS21 


PBS mod 
pH7.4 


thiomersal low 
(lUg/ml) 


5 


12 


/ 


/ 


untreated 


/ 


/ 


1 


/ 


6 


12 


2x30 


IN A 


RSV live 




/ 


1 


/ 


7 


12 


2x30 


INA 


PIV-3 live 




/ 


1 


/ 



*IM^ntra^4£3Ct3as" 
INA=intra-nasal 
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Antigen 


cc. jig/ml 


Buffer 


RSV live 


6.2 

logPFU/ml 




PIV-3 
live 


6.7 

logPFU/ml 




FHN 


120 (2.5ml) 


PBS 
pH 7.3 



Time schedule: 
5 Injection 1 = Day 0 
Injection 2 = Day 28 
First Bleed = Day 28 
Second bleed = Day 42 
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Antigen 


cc. ng/ml 


Buffer 


RSV live 


6.2 

logPFU/ml 




PIV-3 
live 


6.7 

logPFU/ml 




FHN 


120 (2.5ml) 


PBS 
pH 7.3 



Time schedule: 
5 Injection 1 = Day 0 
Injection 2 = Day 28 
First Bleed = Day 28 
Second bleed = Day 42 
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Table 2: Serum antibody response against FHN. 

The total IgG, IgGl and IgG2a was determined for each mouse sera. A mean titer for 
each group was then calculated and is reported in the table. 



group 
n° 


Immunogen 


Total IgG 


IgGl 


IgG2a 


IgGl/IgG2a 


1 


FHN + 3D- 
MPL/QS2 1 


1182000 


109800 


305500 


0.36 


2 


FHN + Alum 


182200 


127100 


4429 


28.7 


3 


FHN 


44990 


22760 


1941 


11.73 


4 


adjuvant=from 
group 1 


49 


32 


ND 


ND 


5 


untreated 


52 


ND 


ND 


ND 


6 


Live RSV 


12840 


748 


2718 


0.27 


7 


Live PiV3 


10860 


2758 


2320 


1.19 



ND==undetermined, the titer being to low 

5 
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Claims 

1 . A process for preparing a heterochimeric protein or an immunogenic 
derivative thereof comprising an immunogenic fragment of the fusion (F) protein of 
RSV, PIV1, PTV2, PIV3, MV or MuV and an immunogenic fragment of the 

5 attachment (G, HN or H) protein of RSV, PIV1, PIV2, PIV3, MV or MuV which 
process comprises expressing recombinant DNA encoding the heterochimeric 
protein or immunogenic derivative thereof in CHO cells and recovering the protein. 

2. A process according to claim 1 wherein at least one non-preferred or less 

10 preferred codon in a natural gene or DNA encoding the said heterochimeric protein 
or immunogenic fragment thereof has been replaced by a preferred codon encoding 
the same amino acid. 

3. A heterochimeric protein or an immunogenic derivative thereof comprising an 
15 immunogenic fragment of the fusion (F) protein of RSV, PIV1, PIV2, PIV3, MV 

or MuV and an immunogenic fragment of the attachment (G, HN or H) protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV, with the proviso that where one of the 
immunogenic fragments is derived from RSV F, RSV G or PIV3 F, PIV3 HN, the 
other of the immunogenic fragments is derived from MuV F, MuV HN, MV F, 
20 MV H, PIV1 F,PIV1 HN, PIV2 F or PIV2 HN. 

4. A process for preparing a heterochimeric protein or immunogenic derivative 
thereof as claimed in claim 3 which process comprises expressing recombinant 
DNA esoediag the ^hs££r^Mnseric»prateiE^r- h snmiffl^CQic "derivative cfeercsf in 

25 either one of; CHO cells or insect cells and recovering the protein. 

5. A protein according to claim 3 wherein the immunogenic fragment of the F 
protein is lacking the membrane anchor domain at its Oterminal end. 

30 6. A protein according to claims 3 or 5 wherein the immunogenic fragment of the 
G* HN or H protein is lacking the signal/anchor domain at its N-terminal end. 
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7. A protein according to any one of claims 3, 5 or 6 which is linked via an amino 
acid in the C-terminal part of the immunogenic fragment of the F protein of RSV, 
PIV1, PIV2, PIV3, MV or MuV to an amino acid in the N- terminal part of the 
immunogenic fragment of the G protein of RSV or the HN protein of PIV1 , PIV2, 

5 PIV3, MuV or the H protein of MV. 

8. A protein according to any one of claims 3 , 5, 6 or 7 which commences at its N- 
terminal end with a signal sequence from the F protein of RSV, PIV1, PIV2, PIV3, 
MV or MuV. 

10 

9. A protein according to any one of claims 3,5,6 or 7 which commences at its N- 
terminal end with a signal sequence from TPA. 

10. A protein according to any one of claims 3 or 5 to 8 which comprises a 
15 ubiquitin leader sequence. 

11. A protein according to any one of claims 3 or 5 to 9 which comprises a 
polyhistidine tail. 

20 12. A protein according to claim 10 or 1 1 which comprises a cleavage site for 
cleaving off the ubiquitin leader sequence and/or the polyhistidine tail. 

13. A heterochimeric protein according to any one of claims 3 or 5 to 1 1 which is 

25 Fs+a-RSVxHNs aMuV; 

Fs + a* PIV3 x HNs a MuV; 
Fs+a MuV x GsaRSV; or 
FsVMuV x HNs'aPIV3, or 
an immunogenic derivative thereof. 



30 



14. A heterochimeric protein according to any one of claims 3 or 5 to 11 which is 
selected from the group consisting of: 
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Fs + a MuV xHsaMV; or 
Fs + aRSVx HNs aPIVl, or 
Fs + a RSVx HNs a'PIV2, or 
an immunogenic derivative thereof. 

5 

15. A heterochimeric protein which is: 

FsV (1-526) RSV x HNs* a' (70-572) PIV3, 
Fs + a(1^92) PIV3 x Gsa" (69-298) RSV, 
Fs + a (1-526) RSV x HNs'a (70-572) PIV3 bis, 
10 Fs*a (1-526) RSV x HNs'a" (70-572) PIV3 ent his, or 

sTPA (1-21) UB (1-74) ent Fs a' (24-526) x HN s a (70-572) PIV3, or 
an immunogenic derivative thereof. 

16. Recombinant DNA encoding a heterochimeric protein or an immunogenic 
15 derivative thereof according to any one of claims 3 or 5 to 15. 

17. Recombinant DNA according to claim 16 in which at least one non-preferred 
or less preferred codon in the DNA has been replaced by a preferred codon 
encoding the same amino acid. 

18. DNA which hybridises under conditions of high stringency with the DNA of 
20 claim 16 or 17. 

19. An expression vector comprising recombinant DNA according to claims 16 to 
18. 

20. A host transformed with DNA according to any one of claims 16 to 18 or with 
a vector according to claim 19. 

25 21 . A host according to claim 20 which is a CHO cell. 

22. A host according to claim 21 which is an insect cell. 
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23. A vaccine composition comprising a protein according to any one of claims 3 
or 5 to 13 or an immunogenic derivative thereof in admixture with a 
pharmaceutical^ acceptable carrier. 

24. A vaccine composition according to claim 23 further comprising 3D 
5 Monophosphoryi lipid A and/or QS-21. 

25. A vaccine composition according to claims 23 or 24 wherein the carrier is an 
oil-in- water emulsion. 

26. A heterochimeric protein or an immunogenic derivative thereof according to 
any one of claims 3 or 5 to 15 for use in medicine. 

10 27. A process for the production of a heterochimeric protein according to any one 
of claims 3 or 5 to 15 which process comprises expressing recombinant DNA 
encoding said protein or immunogenic fragment thereof in a host cell and 
recovering the protein. 

28. A method of treating a human or animal susceptible to paramyxoviridae viral 
15 infections comprising administering an effective amount of a vaccine according to 

any one of claims 23 to 25. 

29. Use of a protein or an immunogenic derivative thereof according to any one of 
claims 3 or 5 to 15 in the manufacture of a medicament for use in the treatment of 
respiratory disorders. 
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Fig. 1 



pNIV2819 



1/73 



Hindlll BamHI PstI 

T T 



t 

ATG 

PGR 64bp 
(aa 1-17) 



Fs + a 



aa 18-489 



Nsil BamHI 



pUC19 



STOP 

PCR 121bp 
(aa 490-526) 



<•• — 



1593 bp 
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Fig- 2 



2/73 



pNIV2820 

Hindi II BamHI 



t 

ATG 



Nsil 



BamHI EcoRI 







aa 409-526 





pUC19 



1 

STOP 



Fs + a- 



Substitution of the Nsil-EcoRI 
fragment of pNIV28 19 by the PCR- 
generated Nsil-EcoRI DNA piece 
corresponding to aa„ 0 - aaj 74 



Hindlll BamHI 



t 

ATG 



Nsil 



EcoRV EcoRI 



aa 409^574 



-pUC19 



I 

STOP 



FsV 
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Nsil BamHI 
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t 

STOP 
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Fig. 4 



A) Synthetic adaptators 

5' C ATG AAT GAT CAA GGC TTG AGC AA 3' 

TTA CTA GTT CCG AAC TCG TTA GTC 

BspHI BbsI 



[SEQ ID NO: 1] 



B)pNIV4102 
Hindlll BamHI 



ATG 



BspHI BbsI 

1 ! 



BamHI 



Frsv 1-526 




HN MuV 60-582 



-pUC19 



STOP 



C)pNlV4104 



BamHI 



I 



T 



ATG 



F RS v 1-526 



BamHI 




HN MuV 60-582 



4l_ P SFV1 



STOP 
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Fig. 5 



A) Synthetic adaptators 

5' C ATG AAC AAT GAG TTT ATG GAA GTT ACA GAA AAG ATC CAA 
TTG TTA CTC AAA TAC CTT CAA TGT CTT TTC TAG GTT 

BspHI 

ATG GCA TCG GAT ATT AT 3' 

TAC CGT AGC CTA TAA TATA [SEQ ID NO: 2] 

Asel 



B)pNIV4109 

BamHI Asel BspHI BamHI 



amHI Asel tJspHl nanu 

I , ij I 

4_J HN Prv3 70-572 H Frsv 1-526 \± 




Frsv 1-526 P?^pUC 1 9 



STOP ATG 



G)pNIV4110 



BamHI 



BamHI 



1 



1-526 



1 




HN PIV3 70-572 



1 



+__ P SFV1 



ATG 



STOP 
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Fig. 6 



A) Synthetic adaptators 

5' GAT CTA GAA GAG TCA AAA GAA TGG ATA AGA AGG TCA AAT CAA 
AT CTT CTC AGT TTT CTT ACC TAT TCT TCC AGT TTA GTT 

Bgin 

AAA CTA GAT TCC ATT GGA AAT TGG CAT CAA TCT AGC ACC 3' 
TTT GAT CTA TGG TAA CCT TTA ACC GTA GTT AGT TCG TGG CAGT G 

Maein 

[SEQ ID NO: 3] 



B)pNIV4103 

BamHI 



ATG 



BgUI MaelU 



PIV3 



1-492 




J RSV 



69-298 



Hindlll 

ii_pUC19 



STOP 



C) pNP/4106 

Smal/BamHI 



F PIV3 1-492 




ATG 



Grsv 69^298 



HindlUlSm?! 

1 



±_pSFVl 



STOP 
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Fig. 7 



A) Synthetic adaptators 

5'G ATC TAG AAG AGT CAA AAG AAT GGA TAA GAA GGT CAA ATC 
ATC TTC TCA GTT TTC TTA- CCT ATT CTT CCA GTT TAG 

Bglll 

AAA AAC TAG ATT CCA TTG GAA ATT GGC ATC AAT CTA GCA CCA 
TTT TTG ATC TAA GGT AAC CTT TAA CCG TAG TTA GAT CGT GGT 



CAA ATG ATC AAG GCT TGA GCA A 3' 
GTT TAC TAG TTC CGA ACT CGT TAGTC 

Bbsl 



[SEQ ID NO: 4] 



B)pNIV4117 



Bamffl 



ATG 



prva 



1-493 



Bglll Bbsl 

I i 




BamHI 



HN MuV 60-582 



I 



-pUC19 



STOP 



C) pNTV4118 

BamHI 



ATG 



PIV3 



1-493 



BamHI 



HN M uv60-582 



i_pSFVl 



1 



STOP 
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A) Synthetic adaptators 

5' GAA TGC CGT TAA ATA CAT CAA GAG AGT AAC CAT CAA 

A CGT CTT ACG GCA ATT TAT GTA GTT CTC TCA TTG GTA GTT 
PstI 

CTC CAT CGG TCT CAG TAA GTT CTA AA 3' 

GAG GTA GCC AGA GTC ATT CAA GAT TTC AGT [SEQ ID NO 

Maelll 



B)pNIV4113 

Asp718I 



F MuV l-482 



ATG 



PstI Maeni 

i I 




Grsv 69-298 



Asp718I 

▼ pBluescript 



STOP 



e)pNIV4114 



Smal/Asp718I 



F MuV l-482 



Asp718I/Smal 




Grsv 69-298 



▼—pSFVl 



ATG 



STOP 
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Fig- 9 



A) Synthetic adaptators 

5' GTAAGTTCTAAA 3' 

CAAGATTTTTAA [SEQ ID NO: 6] 
Bsal EcoRI 



B)pNIV4115 



BamHI 
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HN PIV3 54-572 
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STOP 
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Fig. 10 
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Fig. 12 
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Fig. 13 
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Fig. 14 
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Fig. 15 
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Fig. 18 

A) Synthetic adaptators 

5' C ATG AAC AAT GAG TTT ATG GAA GTT ACA GAA AAG ATC CAA 
TTG TTA CTC AAA TAC CTT CAA TGT CTT TTC TAG GTT 

BspHI 



ATG GCA TCG GAT ATT AT 3' 
TAC CGT AGC CTA TAA TAT A 

Asel 



[SEQ ID NO: 7] 
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Fig. 19 

A) Synthetic adaptators 

PstI 5'GT GAC GAT GAC GAT AAG CAT CAT CAT CAT CAT CAT TAG 
ACGTC ACA CTG CTA CTG CTA TTC GTA GTA GTA GTA GTA GTA ATC 



GGATCCGCATG 3 1 

CCTAGGC SphI [SEQIDNO:8] 
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Fig. 20 



A) pNTV3475 

EcoRI Xbal 



AfUI BamHI 



Saccharomyces cerevisiae Ubiquitin 



X_pUC19 



ATG 



Gly 76 



B) Synthetic adaptators 

5' CT AGC ATG CAG ATC TTC GTC AAG ACG TTA ACC GGT AAA ACC 
Nhel G TAC GTC TAG AAG CAG TTC TGC AAT TGG CCA TTT TGG 



ATA ACC 3' Xbal 
TAT TGG ATCT 



[SEQ ID NO: 9] 
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Fig. 21 
A)pNIV4122 

Hindni 





aa 1-21 sTPA 
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aa 1-76 Ubiquitin 















Aflll BamHI Spel 

pBluescript 



ATG 
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B) Synthetic adaptators 

5' TTA AGA CTA AGA GAC GAT GAC GAT AAG TCC AGT CAA AAC 
Aflll CT GAT TCT CTG CTA CTG CTA TTC AGG TCA GTT TTG 

ATC ACT GAA GAA TTT TAT CAA TCA ACA TGC AGT GCA GTC AGC 
TAG TGA CTT CTT AAA ATA GTT AGT TGT ACG TCA CGT CAG TCG 

AAA GGC TAT CTT AGT GCT CTA AGA ACT GGT TGG TAT A3' Spel 
TTT CCG ATA GTT TCT CGA GAT TCT TGA CCA ACC ATA TGA TC 
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Fig. 22 
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A)pNTV4120 
BamHI 
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STOP 
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Fig. 24 



A) pNIV4120 



Spel 



BamHI 



F RSV 1-526 




HN PJV3 70-572 



_pUC19 



ATG 



STOP 



B) Synthetic adaptators 

5 'GAT CAA AAC ATC ACT GAA GAA TTT TAT CAA TCA ACA TGC 
BamHI TT TTG TAG TGA CTT CTT AAA ATA GTT AGT TGT ACG 

AGT GCA GTC AGC AAA GGC TAT CTT AGT GCT CTA AGA ACT 
TCA CGT CAG TCG TTT CCG ATA GAA TCA CGA GAT TCT TGA 

'GGT "TG'G ' TA"T "A 3**Spei / 

CCA ACC ATA TGA. TC [ SEQ ID NO : 1.1] 

C) pNIV4136 



BamHI Spel 
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Fig 25: SDS-PAGE (reduced conditions) of the F RSV HN PiV3 protein purified by 
immunoaffinity from the spent culture medium of the recombinant baculovirus 3546. 

kDa: molecular weight marker 
A; Coomassie blue staining 

B: Western blot revealed by a goat polyclonal anti-RSV serum 20 RG45 



kDa A B 
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Fig . 26: Codon usage of F RSV HN PiV3 and highly expressed human genes (hum high exp) 
showing frequencies (xlOO) of the individual codons for each of the degenerately encoded 
amino acids, and the most prevalent codon in bold. 
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Fig. 27: Schematic diagram of the PCR synthesis of each fragment showing unique 
restriction sites along the sequence (black dots) and restriction sites (A and B) that allow 
retrieval of the full size fragment from the cloning vector. 
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Fig 28: Sequence of the 18 oligonucleotides from which PCR fragment A was generated. 

1) olfhuml.seq, .bases 1-90 of F RS vHN PiV 3, homologous to mRNA 

5' cccTCTAGAGGATCCACCATGGAGCTGCTGATtttaAAGACCAACGCCATCACCGCCATCCTG 

GCCGCGGTGACCCTCTGCTTCGCGTCC 

2) olfhum2 . seq, bases 75-165 of F R svHN PiV 3, inverse complementary to 
mARN 

5 ' CCTCAGCGCGCTCAGGTAGCCCTTGCTGACAGCagaGCAGGTGGACTGGTAGAACTCCTCGGTG 
ATGTTCTGGCTGGACGCGAAGCAGAGG 

3) olfhum3.seq, bases 150-240 of FRsvHNp iV 3, homologous to mRNA 

5 ' CCTGAGCGCGCTGAGGACGGGGTGGTACACtAGtGTGATCACCATCGAGCTGAGCAACATCAAG 

GAGAACAAGTGCAACGGCACCGACGCC 

4) olfhum4.seq, bases 225-310 of F R svHN Pi v3, inverse complementary to 

mARN * v 

5 ' GCATCAGCAGCTGCAGCTCGGTCACGGCGCTCTTGTACTTGTCCAGCTCCTGCTTGATCAGCTT 

CACCTTGGCGTCGGTGCCGTTG 

5) olf hum5. seq, bases 295-397 of F RSV HN PiV 3, homologous to mRNA 

5 ' CTGCAGCTGCTGATGCAGAGCACCCCCGCCACCAACAACagaGCCAGGCGCGAGCTGCCCAGGT 

TCATGAACTACACCCTCAACAACACCAAGAACACCAACG 

6) olfhum6.seq, bases 37B-496 of F RSV HN P1V 3/ inverse complementary 
to mRNA 

GGTGCAGGACCTTGGACACCGCGATGCCGCTGGCGATGGCGGAGCCCACGCCCAGCAGGAAGCCCA 
GGAAgCGcctCTTgCgCTTCTTGCTCAGGGTCACGTTGGTGTTCTTGGTGTTG 

7) olfhum7.seq, bases 480-561 of F RSV HN P iv3/ homologous to mRNA 

5' GTCCAAGGTCCTGCACCTGGAGGGGGAGGTGAACAAGATC7VAGAGCGCCCTGCTCTCCACCAAC 

AAGGCGGTGGTCAGCCTG 

8) olfhum8.seq, bases 543-633 of F R svHN Pi v3, inverse complementary 

S^GGGGAGCAatTGCTTGTCGATGTAGTTGTTGAGGTCCAGCACCTTGCTGGTCAGCACGCTCACG 
CCGTTGGACAGGCTGACCACCGCCTTG 

9) olfhum9. seq/" bases 609-676 of F R svHN Pi v3, homologous to mRNA 

5 ' CTAGATCGACAAGCAatTGCTCCCCATCGTGAAGAAGCAGtcCTGCAGCATCTCTAACATTGAG 

ACCG 

10) olfhumlO.-seq, bases 653-732" of F RS vHN Piv3 , inverse complementary 

to mARN . 
5/ GCTGAACTCCGTGGTGATCTCCAGCAGCCTGTTGTTCTTCTGCTGGAACTCGATCACGGTGTCA 

ATGTTAGAGATGCTGC 
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11) olfhumll seq, bases 714-787 of F RS vHN Pi v3, homologous to mRNA 
5'GATCACCAGGGAGTTCAGCGTGAACGCgGGcGTcACCACCCCGGTGAGCACCTACATGCTGACC 

AACAGCGAGC 

12) olfhuml2.seq, bases 768-846 of F R svHN Pi v3. inverse complementary 

5°GTTGGACATaAGCTTCTTCTGGTCGTTGGTGATGGGCATGTCGTTGATCAGGGACAGCAGCTCG 
CTGTTGGTCAGCATG 

13) olfhuml3.seq, bases 825-916 of FrsvHN p j.v3, homologous to mRNA 

5' GCAGAAGAAGCTtATGTCCAACAACGTGCAGATCGTGCGCCAGCAGAGCTACagCATCATGagC 

ATCATCAAGGAGGAGGTGCTGGCCTACG 

14) olfhuml4 . seq, bases 900-990 of FRsvHNpiva, inverse complementary 

VGGTGGTGCACAGGGGGGAGGTGTGCAGCTTCCAGCAGGGGGTGTCGATCACGCCGTACAGGGGC 
AGCTGCACCACGTAGGCCAGCACCTCC 

15) olfhuml5.seq, bases 975-1065 of F^yHNpivs, homologous to mRNA 

5' CCCCCTGTGCACCACCAACACCAAGGAGGGCTCCAACATCTGCCTGACCCGCACCGACCGGGGC 

TGGTACTGCGACAACGCCGGCTCCGTG 

16) olfhuml6.seq, bases 1048-1133 of FrsvHNpivs, inverse 
complementary to mARN 

5 ' CTGTTCATGGTGTCGCAGAACACGCGGTTGGACTGCACCTTGCAGGTCTCCGCCAGGGGGAAGA 
AGGACACGGAGCCGGCGTTGTC 

17) olfhuml7.seq, bases 1116-1210 of FrsvHNp^, homologous to mRNA 
5'CTGCGACACCATGAACAGCCTGACCCTGCCCAGCGAGGTGAACCTCTGCAACATCGACATCTTC 

AACCCCAAGTACGACTGCAAGATtATGacctcc 

18) olfhuml8.seq, bases 1195-1295 of FusvHNpiva, inverse 
complementary to mARN 

gggaattctgtacacttggtcttgccgtagcaggacacgatggcgcccagggaggtgatcacggag 
.ctg J z.t.cac.at-C"gg.tcttgga.g.gtCATaATCTTGCAG 

[SEQUENCES ABOVE ARE SEQ ID NOs: 12 to 29, respectively] 
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Fig. 29: Construction of pEE14 F RSV humHNp iV 3 
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Fig, 30: Sequence of the 10 oligonucleotides fr m which PGR fragment B was generated. 

1} olf hnhuml9 . seq, bases 1269-1353 of F RS vHN PiV 3, homologous to mRNA 
5 1 cggcaagaccaagtgtacagcctccaacaagaaccgcggcatcatcaagaccttctccaacgg 
ctgcgact acgtgtccaacaag 3 1 

2) olf hnhum20 . seq, bases 1336-1428 of F R svHN PiV 3, inverse 
complementary of mARN 

5' cttcacgtacaggctcttgccctcctgcttgttcacgtagtacagggtgttgcccacggacac 
ggtgtccacgcccttgttggacacgtagtc 3 1 

3) olfhnhum21 . seq f bases 1413-1497 of F R svHN Pi v3, homologous to mARN 
5 T gagcctgtacgtgaagggcgagcccatcatcaacttctacgacccgctggtgttcccctccga 
cgagttcgacgcctccat ctccc 3 ' 

4) olf hnhum22 . seq, basesl4 8 3-15 99 of F RS vHN PiV 3, inverse 
complementary of mARN 

5 ' gttcatgatgttggtggtggacttgccggcgttcacgttgtgcagcagctcgtcggacttgcg 
gatgaaggccaggctctggttgatcttctcgttcacctgggagatggaggcgtc 3 1 

5) olfhnhum23. seq, bases 1581-1691 of F RS vHN PiV 3/ homologous to mARN 
5 1 caccaccaacatcatgaacaacgagttcatggaggtgaccgagaagatccagatggcctccga 
caacatcaacgacctgatccagtccggcgtgaacacccggctgctgac 3 1 

6) olfhnhum24 . seq, bases 1677-1779 of F RSV HN PiV 3f inverse 
complementary of mARN 

5 ' gatggtgatctcgctgatgaacttccgcaggtcggacatctgctgggtcagggagatggggat 
gtagttctgcacgtggctctggatggtcagcagccgggtg 3 ? 

7 ) olfhnhum25 . seq, bases 1761-1865 of F R5V HN Pi v3, homologous to mRNA 
5 ' catcagcgagatcaccatccggaacgacaaccaggaggtgcccccccagaggatcacccacga 
cgtgggcataaagcccctgaaccccgacgacttctggcgctg 3 * 

8) olf hnhum26. seq, bases 1849-1967 of F RS vHN PiV 3, inverse 
complementary -of mARN 

5 1 gtgcgcacgcagccgtccacggtggtgggcatggccagcaggccgggcccgggcatcagcctt 
atcttgggggtcttcatcagggaggggaggccggaggtgcagcgccagaagtcgtc 3 ' 

9) .olf-hnhum27 . seq, bases 1953-2059 of F RSV HNpiv3, homologous to mRNA 
5 Vcggctgcgtgcgcaccccctccctggtgatcaacgacctgatctacgcctacacctccaacct 
gatcacccgcggctgccaggacatcggcaagtcctaccaggtgc 3 1 

10) olfhnhum28 . seq, bases 2043-2154 of FRsvHNpiv3, r inverse 
complementary of mARN 

5 * ggacttcctgttgtcgttgatgttgaaggtgtgggagatccgggggttcaggtcgggcaccag 
gtcggagttcacggtgatgatgccgatctgcagcacctggtag.gacttg 

[SEQUENCES ABOVE. ARE SEQ ID.NOs: 30—39, respectively] 
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Fig. 31: Sequence f the 16 oligonucleotides from which PCR fragment C was generated. 

1) olfhnum2 9.seq, bases 2139-2229 of F RS vHN PiV 3, homologous to mRNA 
5»cgacaacaggaagtcctgctccctggccctcctgaacaccgacgtgtaccagctgtgctccac 

gcccaaggtggacgagcgctccgactac 3' 

2) olfhnhum30.seq, bases 2214-2307 of F RS vHN PiV 3/ inverse 
complementary to mRNA 

5'gttcttgaagcgggtggtggagatggagccgtcgtggttgacgatgtccagcacgatgtcctc 
gatgccggagctggcgtagtcggagcgctcg 3 T 

3) olfhnhum31.seq, bases 2292-2398 of F RS vHN PiV 3, homologous to mRNA 
5' cacccgcttcaagaacaacaacatcagcttcgaccagccctacgccgccctgtacccctccgt 

gggccccggcatctactacaagggcaagatcatcttcctgggc 3 f 

4) olfhnhum32 . seq, bases 2382-2472 of F RS vHN PiV 3, inverse 
complementary to mRNA 

5' ccgctgggtcttgccggggcacccggtggtgttgcagatggcgttctcgttgatggggtgctc 
caggccgccgtagcccaggaagatgatc 3 1 

5>olfhnhum33.seq, bases 2457-2549 of F^vHNp^, homologous to mRNA 
5'cggcaagacccagcgggactgcaaccaggcctcccacagcccctggttctccgaccgccgcat 

ggtgaactccatcatcgtggtggacaaggg 3 1 

6) olfhnhum34.seq, bases 2532-2643 of F RS vHNp iv3 , inverse 
complementary to mRNA 

5' cttgttgcccagcagcagcaggcggccctcggagccccagtagttctgccgcatggagatggt 
ccacaccttcagcttggggatggagttcaggcccttgtccaccacgatg 3 r 

7) olfhnhum35.seq, bases 2628-2726 of F RS vHN PiV 3, homologous to mRNA 
S'gctgctgggcaacaagatctacatctacacccgctccaccagctggcacagcaagctgcagct 

gggcatcatcgacatcaccgactacagcgacatccg 3 ' 

8) olf hnhum36. seq, bases 2710-2781 of FRsvHNpiva, inverse 
complementary to rrfRlCTA 

5'ggggcactcgttgttgccgggccggctaagcacg.ttgtgccaggtccacttgatgcggatgtc 
gctgtagtc 3 1 

9) olfhnhum37.seq, bases 2765-283.6 of Fj^vHNpivs *- homologous" to mRNA 
5' gcaacaacgagtgcccctggggccactcctgccccgacggGtgcatcaccggcgtgtacaccg 

acgcctacc 3' 

10) olfhnhum38.seq, bases ,2820-288 9 of F RSV HNpiv3, -inverse 
complementary : to mRNA 

5 ' cttctggg-agtccaggatcacggagctcacgatgct.gccggtggggttGagggggta.ggcgtc 
ggtgtac 
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11) olf hnhum39 . seq, bases 2874-2943 of F RSV HN Pi v3, homologous to mRNA 
5 • cctggactcccagaagtcccgggtgaaccccgtgatcacctacagcacctccaccgagcgcgt 

gaacgag 

12) olfhnhum40. seq, bases 2927-2994 of F RS vHN Pi v3f rom: 1 to: 68, 
inverse complementary to mRNA 

5' gcagctggtggtggtgtagccggcgctcagggtcttgttgcggatggccagctcgttcacgcg 
ctcgg 

13) olfhnhum41.seq, bases 2979-3043 of F RS vHN PiV 3, homologous to mRNA 
5' caccaccaccagctgcatcacccactacaacaagggctactgcttccacatcgtggagatcaa 

cc 

14 ) olf hnhum42 . se'q, bases 3027-3085 of F n svHN PiV 3, inverse 
complementary to mRNA 

5 ' cggtcttgaacagcatgggctggaaggtgtccaggctcttgtggttgatctccacgatg 3 ' 

15) olfhnhum4 3 . seq, bases 3069-3114 of F RS vHN PiV 3/ homologous to mRNA 
5' catgctgttcaagaccgagatccccaagagctgcagctaaGAATTC 3 ' 

[SEQUENCES ABOVE ARE SEQ ID NOs : 40-54, respectively] - 



DEC 04 2000 18:50 



PAGE. 88 



WO 00/18929 



PCT/EP99/07004 



34/73 

Fig, 32 : Construction of pEE14F RSV hum HN PiV jbum 
a) pNIV4120 +PCR fragment A 

BsrGI 

XbaJ 



EcoRl 



F RSV 1265-1578XHN fSV1 pUC19 



b) PCR fragment B 



BsrGI 



Kpnl 



Frsv HN Pl „ 1 264-2 1 3 6hum 



c) PCR fragment C 



Kpnl 



EcoRI 



IF.RSvHNpiV32.l36- 
3090hum 



d) pEE14 F RSV hum HN piV3 hum 



Xbal 



EcoRI 



FRsvhumHNp.vahum 



pEE14 
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Fig . 33A : Humanized nucleic acids sequence of F KSV HN PiV3 (upper sequence) compared to 
the original sequence found in the pNIV4120. 

7 AGAGGATCC . . ACCATGGAGCTGCTGATtttaAAGACCAACGCCA 4 9 

I I I I I I II I I I I I I I I I I I I I I I I III I I ol11 

2262 AGAGGATCCCCCGGGTAccatggagttgctaatcctcaaaacaaatgcaa 2311 

50 TCACCGCCATCCTGGCCGCGGTGACCCTCTGCTTCGCGTCCAGCCAGAAC 99 

I | | I I II II N M I I I I I N N " 

2312 ttaccgcaatccttgctgcagtcacactctgttttgcttccagtcaaaac 2361 

100 ATCACCGAGGAGTTCTACCAGTCCACCTGCtctGCTGTCAGCAAGGGCTA 14 9 

II II I I I I I I I I I I 

2362 atcactgaagaattttatcaatcaacatgcagtgcagtcagcaaaggcta 2411 

150 CCTGAGCGCGCTGAGGACGGGGTGGTACACtAGtGTGATCACCATCGAGC 199 

|| || II II II II II I I I I I M M II 'I 

2412 tcttagtgctctaagaactggttggtatactagtgttataactatagaat 2461 

. • « " 

200 TGAGCAACATCAAGGAGAACAAGTGCAACGGCACCGACGCCAAGGTGAAG 24 9 

I || II I I I I I I I I M I M I I II II N I I I I I I I I I I II 
24 62 taagtaatatcaaggaaaataagtgtaatggaacagacgctaaggtaaaa 2511 

250 CTGATCAAGCAGGAGCTGGACAAGTACAAGAGCGCCGTGACCGAGCTGCA 299 

| | |.| | | | | | I I II II M II II II II 'I 'I I I I I 
2512 ttgataaaacaagaattagataaatataaaagtgctgtaacagaattgca 2561 

300 GCTGCTGATGCAGAGCACCCCCGCCACCAACAACagaGCCAGGCGCGAGC 34 9 

I I I I I I I I I I I I I I I I I II I 

2562 gttgctcatgcaaagcacaccggcaaccaacaatcgagccagaagagaac 2611 

m 

350 TGCCCAGGTTCATGAACTACACCCTCAACAACACCAAGAACACCAACGTG 3 99 

I II Mill I I I I I M M I I I I I I II I I I I I I I I I I I I I I 

25 1 2 "•tra-c'c-a- a -g g L L • c gt-geratrvo'c a" a-c tea ac a-et-E c-sea-ta & atee&a-a t-qt-a. -2 6-^, 

400 ACCCTGAGCAAGAAGcGcAAGaggCGcTTCCTGGGCTTCCTGCTGGGCGT 44 9 

II I I I I I I I I I 1 II II I I I I I I I I I I II I M M 
2662 acattaagcaagaaaaggaaaagaagatttcttggetttttgttaggtgt 2711 

4 50 GGGCTCCGCC-ATCGCCAGCGGCATCGCGGTGTCCAAGGTCCTGCAGCTGG 499 

II ||. II II I I I M I I I I I I II M M I I I I II I I I I I I I I I 
2712 tggatctgcaatcgcGagtggcattgctgtatGtaaggtcctgcacctag 27 61 

5 00 AGGGGGAGGTGAACAAGATGAAGAGCGCCCTGCTCTCCACCAACAAGGCG 54 9 

I I I I 1 I I I I MM I Mill M II M II I M II I I M II M 
2762 aaggggaagtgaacaaaatcaaaagtgctctactatccacaaacaaggct 2811 
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550 GTGGTCAGCCTGTCCAACGGCGTGAGCGTGCTGACCAGCAAGGTGCTGGA 5 99 

|| || II I I I II I I I III I M 

2812 gtagtcagcttatcaaatggagttagtgtcttaaccagcaaagtgttaga 2861 

600 CCTCAAGAACTACATCGACAAGCAatTGCTCCCCATCGTGAACAAGCAGt 64 9 

| | | M | Mill II II II I M II I I I I I I I I I I I 

2862 cctcaaaaactatatagataaacagttgttacctattgtgaacaagcaaa 2911 

. > • 

650 cCTGCAGCATCTCTAACATTGAGACCGTGATCGAGTTCCAGCAGAAGAAC 699 

Ml | | | M | I I I I I I I I I I I I I 

2912 gctgtagcatatcaaacattgaaactgtgatagagttccaacaaaagaac 2961 

7 00 AACAGGCTGCTGGAGATCACCAGGGAGTTCAGCGTGAACGCgGGcGTcAC 74 9 

Mill M II I I I I I Ill II M II II II M M M 

2962 aacagactactagagattaccagggaatttagtgttaatgcaggtgtaac 3011 

m 

7 50 CACCCCGGTGAGCACCTACATGCTGACCAACAGCGAGCTGCTGTCCCTGA 7 99 
I I I I II M II I I 



3012 



3061 



8 00 TCAACGACATGCCCATCACCAACGACCAGAAGAAGCTtATGTCCAACAAC 849 

| | || || | | | II II II II M I I M I III I I I II I M I I I I 
3062 tcaatgatatgcctataacaaatgatcagaaaaagttaatgtccaacaat 3111 

8 50 GTGCAGATCGTGCGCCAGCAGAGCTACagCATCATGagCATCATCAAGGA 8 99 

I I | | | | M I Mill M Ml I I II I I I I I I I I I I I I 
3112 gttcaaatagttagacagcaaagttactctatcatgtccataataaagga 3161 



900 GGAGGTGCTGGCCTACGTGGTGCAGCTGCCCCTGTACGGCGTGATCGACA 94 9 

I I I II I II I I M I I I 'I I' ' ' ' 

3162 ggaagtcttagcatatgtagtacaattaccactatatggtgtaatagata 3211 

950 CCCCCTGCTGGAAGCTGCACACCTCCCCCCTGTGCACCACCAACACCAAG 9 99 

-i ■ •! -! • -I "! "1 1 : M '! • •! -! i •! •!••>! -!-! ••! •!••''• I • *' ■! • >! -! -I * J - ^ ; ' -I " ! ■ " ^ *' 
3212 caccttgttggaaactgcacacatcccctGtatgtacaaccaacacaaag 3261 

1000 GAGGGCTCCAACATCTGCCTGACCGGCACCGACCGGGGCTGGTACTGCGA 104 9 

M 1111111111111 I II I I II I 11 I II I I I I I I I I II 
. 32 62 gaagggtccaacatctgtttaacaagaaccgacagaggatggtactgtga 3311 

...» 

1050 CAACGCCGGCTCCGTGTCGTTCTTCCCCCTGGCGGAGACCTGCAAGGTGC 1 0'9 9 

I I I || I .I .11 M M I I I 1 .1 I ! I II I I I I II II ' I I ' I 
3312 caatgcaggatcagtatctttcttcccaGtagctgaaacatgtaaagttc 3 361 

1 100 AGTCCAACCGCGTGTTCTGCGACACCATGAACAGCCTGACCCTGCCCAGC 114 9 
| M II I I II II II . II I I I M II I I M IN I II ii 
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3362 aatcgaatcgagtattttgtgacacaatgaacagtttaacattaccaagt 3411 

1150 GAGGTGAACCTCTGCAACATCGACATCTTCAACCCCAAGTACGACTGCAA 1199 

|| || | I I I I II I I I I I I I I I I I I I I M II I I I I I 

3412 gaagtaaatctctgcaacattgacatattcaaccccaaatatgattgcaa 3461 

1200 GATtATGacctccaagaccgacgtgagcagctccgtgatcacctccctgg 124 9 

| | | | I I I I II II II II II I I I I I I I I I I I II I 

3462 aattatgacttcaaaaacagatgtaagcagctccgttatcacatctctag 3511 

1250 gcgccatcgtgtcctgctacggcaagaccaagtgtacagcctccaacaag 1299 

| | | | M | I I I I I I I I I I I I I I II II I I I I I I I I 

3512 gagccattgtgtcatgctatggcaaaactaaatgtacagcatccaataaa 3561 

9 . • • 

1300 aaccgcggcatcatcaagaccttctccaacggctgcgactacgtgtccaa 1349 

M | | | | I I II I I I I I I II M Mill II II II M M II 
3 562 aatcgtggaatcataaagacattttctaacgggtgtgattatgtatcaaa 3611 

1350 caagggcgtggacaccgtgtccgtgggcaacaccctgtactacgtgaaca 1399 

| | | I I I I I I I I I I M I M M M II II I II 'I ' 1 I 1 I 
3 612 taagggggtggacactgtgtctgtaggtaatacattatattatgtaaata 3 661 

* 

1400 agcaggagggcaagagcctgtacgtgaagggcgagcccatcatcaacttc 1449 

MM M Mill II M M II M M II II II M II Ml 
3 662 agcaagaaggcaaaagtctctatgtaaaaggtgaaccaataataaatttc 3711 



14 50 tacgacccgctggtgttcccctccgacgagttcgacgcctccatctccca 14 99 

M || M I I II II M I II I I M M I I I I II I I I I I I I I 
3712 tai 



3761 



1500 ggtgaacgagaagatcaaccagagcctggccttcatccgcaagtccgacg 1549 

|| || I I I I II I I II I M I I M I I M II N II I' I I I I I I 
3762 agtcaatgagaagattaaccagagcctagcatttattcgtaaatccgatg 3811 

15 50 agctgctgcacaacgtgaacgccggcaagtccaccaccaacatcatgaBC IS*3S 

I I I II II II M M I I II II M I I M I I I I M M I II 
3812 aattattacataatgtaaatgctggtaaatccacGacaaatatcatgAAC 38 61 

1600 aacgagttcatggaggtgaccgagaagatccagatggcctccgacaacat 164 9 

|| Mill I I M I II I I I I I II I I I II I II I I M M II II 
3862 AATGAGTTTATGGAAGTTACAGAAAAGATCCAAATGGCATCGGATAATAT 3911 

1650 caacgacctgatccagtccggcgtgaacacccggctgctgaGcatccaga 1699 

M II M M I II II M I I I M M II M I I M M I 1 I I 
3912 TAATGATCTAATACAGTCAGGAGTGAATACAAGGCTTCTTACAATTCAGA 3 961 
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1700 accacqtqcagaactacatccccatctccctgacccagcagatgtccgac 1749 

I t| || Hill II I II I I I I I I I I I I I I I I I 

3 962 GTCATGTCCAGAATTATATACCaATATCATTGACACAACAAATGTCGGAT 4 011 

1750 ctqcggaagttcatcagcgagatcaccatccggaacgacaaccaggaggt 1799 

I I I I I I | I I II I I I I I I M 

CTTAGGAAATTCATTAGTGAAATTACAATTAGGAATGATAATCAAGAAGT 



4012 
1800 
4062 



4061 



qcccccccagaggatcacccacgacgtgggcataaagcccctgaaccccg 18 4 9 

Ml II M II H M I I I I I I I I I M I II M I 

GCCTCCACAAAGAATAACACATGATGTGGGCATAAAACCTTTAAATCCAG 



4111 



■ • 

1850 acgacttctggcgctgcacctccggcctcccctccctgatgaagaccccc 18 99 

II I I I I I I I I I I I I I I I I 

4112 ATGATTTTTGGAGATGCACGTCTGGTCTTCCATCTTTAATGAAAACTCCA 4161 



1900 aagataaggctgatgcccgggcccggcctgctggccatgcccaccaccgt 1949 

I I I M I I I I I I I I I I I I I I I I I I I I I I I 

AAAATAAGGTTAATGCCGGGGCCGGGATTATTAGCTATGCCAACGACTGT 



4162 



4211 



1950 qqacggctgcgtgcgcaccccctccctggtgatcaacgacctgatctacg 1999 

II Mill II I M M Ml I II N II M II I 

4 212 TGATGGCTGTGTTAGAACTCCGTCCTTAGTTATAAATGATCTGATTTATG 42 61 

» • * 

2000 cctacacctccaacctgatcacccgcggctgccaggacatcggcaagtcc 2049 

i ii 1 1 1 1 1 ii ii ii ii ii H i m 1 1 1 1 1 ii nil M 

4 262 CTTATACCTCGAATCTAATTACTCGAGGTTGCCAGGATATAGGAAAATCA 4311 

2050 taccaggtgctqcagatcggcatcatcaccgtgaactccgacctggtacc 2099 

M ! I I I I I I I I I ! I I I I I M I I M M I I i I I I I I I I ) 
4 312 TATCAAGTATTACAGATAGGGATAATAACTGTAAACTCAGACTTGGTACC 4361 

2100 cgacctgaacccccggatctcccacaccttcaacatcaacgacaacagga 2149 

4 3.62 TGACTTAAATCCTAGGATCTCTGATACTTTCAACATAAATGACAATAGAA 4411 

2150 agtcctgctccctggccctc.ctgaacaccgacgtgtaccagctgtgctcc 2199 

I M I II II II II I I I I I I I I I I I I I I I I I I II H I I I aa ^ 
4 412 AGTCATGTTCTCTAGCACTCCTAAACACAGATGTATATCAACTGTGTTCG 4 4 61 

. * * 

22.00 ■acgcccaaggtggacgagcgGtccgactacgc.cagc.tceggcatcgagga 224 9 

H M Mi ll II II J II 11.11 II I I I I I I I I \ I I 

44 62 ACTCCCAAAGTTGATGAAAGATCAGATTATGCATCATCAGGCATAGAAGA 4511 

* • • • - " 

2250 catcgtgctggacatcgtcaaccacgaeggctccatctccaccacccgct 2299 

II II II II II I III I II M M I I I I I M I I I I II 
4 512 TATTGTACTTGATATtGTCAATCATGATGGTTCAATCTCAACAACAAGAT 4 5 61 
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2300 tcaagaacaacaacatcagcttcgaccagccctacgccgccctgtacccc 2349 

| | | | I I I I I II II I M M I I II II I I I I I I 

4 5 62 TTAAGAACAATAATATAAGTTTTGATCAACCATATGCGGCATTATACCCA 4 611 

2 3 50 tccgtgggccccggcatctactacaagggcaagatcatcttcctgggcta 2 39 9 

M II II M I I M I I I I I I I I I I I I I II II II II II II 
4 612 TCTGTTGG ACCAGGGATATACTACAAAGGCAA/^ATAATATTTCTCGGGTA 4 661 

2 4 00 cggcggcctggagcaccccatcaacgagaacgccatctgcaacaccaccg 24 4 9 

|| || II M || II II II I I I I I II I I I I I I I I I I I II I 
4 662 TGGAGGTCTTGAACATCCAATAAATGAGAATGCAATCTGCAACACAACTG 4 711 

2 4 50 ggtgccccggcaagacccagcgggactgcaaccaggcctcccacagcccc 2 4 99 

I I I I Mill II I I I M I I M I I II I I I I I I II II M II 
4 712 GGTGTCCCGGGAAAACGCAGAGAGACTGCAATCAGGCATCTCATAGTCCT 4 7 61 

2 500 tggttctccgaccgccgcatggtgaactccatcatcgtggtggacaaggg 2 54 9 

Mill II III I I II II I I I II II M II II II II I II M I 
4 7 62 TGGTTTTCAGACAGAAGGATGGTCAACTCCATTATTGTTGTTGACAAGGG 4 811 

• • * 

2 5 50 cctgaactccatccccaagctgaaggtgtggaccatctccatgcggcaga 25 99 

| | | | | || M II II I II I I I I I II I I I II I I M II I II I 
4 812 CTTAAACTCAATTCCAAAACTGAAGGTATGGACGATATCCATGAGACAAA 4 8 61 

2 600 actactggggctccgagggccgcctgctgctgctgggcaacaagatctac 2 64 9 

I II I I I I II M I I II I II II II II II I I II I II I I I I 

48 62 ATTACTGGGGGTCAGAAGGAAGGCTACTTCTACTAGGTAACAAGATCTAT 4911 

2 650 atctacacccgctccaccagctggcacagcaagctgcagctgggcatcat 2699 

M I I- I I I M I I II II I I I I M I M I I I I II I I M 
4 912 ATATATACAAGATCTACAAGTTGGCATAGCAAGTTACAATTAGGAATAAT 4 961 

2 7 00 cgacatcaGcgactacagcgacatccgcatcaagtggacctggcacaacg 27 4 9 

I I i 1 n Vi 11111 11 "1 1 1 1 1 11 1111 1 11 1 "i 1 i i i 
4 9 62 TGATATTACTGATTACAGTGATATAAGAATAAAATGGACATGGCATAATG 5011 

• - • * * 

2 7 50 tgctgagGcggcccggcaacaacgagtgcccctggggccactcctgcccc 27 99 

MM I II II I I II I " M M M I I II I M II I I I I I 

5012. TGCTATCAAGACCAGGAAACAATGAATGTCCATGGGGACATTCATGCCCA 5061 

28 00 gacggctgcatcaccggcgtgtacaccgacgcctaccccctgaaccccac 284 9 

II II II II I I I I M II II II II II M M M I 'M I I 

50 62 GATGGATGTATAACAGGAGTATATACTGATGGATATCCACTCAATCCCAC 5111 

. - - • * 

2 8 50 cggcagcatcgtgagctccgtgatcctggactcccagaagtcccgggtga 28 99 

I I I I I II I I I I I I I I I I II I I I I I I I I I INI 
5112 AGGGAGGATTGTGTCATCTGTCATATTAGACTCGCAAAAATCGAGAGTAA 5161 
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, . • 

2900 accccgtgatcacctacagcacctccaccgagcgcgtgaacgagctggcc 2949 

DM || II II III I I I I I I I I I I I I I I I I I I I I I I I 
5162 ACCCAGTCATAACTTACTCAACATCAACTGAAAGGGTAAACGAGCTGGCC 5211 

2950 atccgcaacaagaccctgagcgccggctacaccaccaccagctgcatcac 2999 

, I I ,| I | I I | | | | | | | I I I I I I I I I I M I I I I I I M 

5212 ATCCGAAACAAAACACTCTCAGCTGGATATACAACAACGAGCTGCATTAC 5261 

3000 ccactacaacaagggctactgcttccacatcgtggagatcaaccacaaga 3049 

I I I I I I I I t I II M M II M M II II I I M H II I conn 
5262 ACACTATAACAAAGGATATTGTTTTCATATAGTAGAAATAAATCATAAAA 5311 

3050 qcctggacaccttccagcccatgctgttcaagaccgagatccccaagagc 3099 

|| ill | | | I I I I I II Ml I I I I I I I I I I I I I I N 'I HI 
5312 GCTTAGACACATTCCAACCTATGTTGTTCAAAACAGAGATTCCAAAAAGC 5361 

3100 tgcagctaaGAAT 3112 

I I I I I II I II 
5362 TGCAGTTAATCAT 5374 
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Fig. 33B 



(Linear) MAP of: Fr svHNpiv3 . seq check: 7448 from: 1 to: 
3090 nucleic acicis sequence of FrsvHNpiv3 (non humanised) 



atggagttgctaatcctcaaaacaaatgcaattaccgcaatccttgctgc 
agtcacactctgttttgcttccagtcaaaacatcactgaagaattttatc 
aatcaacatgcagtgcagtcagcaaaggctatcttagtgctctaagaact 
ggttggtatactagtgttataactatagaattaagtaatatcaaggaaaa 
taagtgtaatggaacagacgctaaggtaaaattgataaaacaagaattag 
ataaatataaaagtgctgtaacagaattgcagttgctcatgcaaagcaca 
ccggcaaccaacaatcgagccagaagagaactaccaaggtttatgaatta 
tacactcaacaataccaaaaataccaatgtaacattaagcaagaaaagga 
aaagaagatttcttggctttttgttaggtgttggatctgcaatcgccagt 
ggcattgctgtatctaaggtcctgcacctagaaggggaagtgaacaaaat 
caaaagtgctctactatccacaaacaaggctgtagtcagcttatcaaatg 
gagttagtgtcttaaccagcaaagtgttagacctcaaaaactatatagat 
aaacagttgttacctattgtgaacaagcaaagctgtagcatatcaaacat 
tgaaactgtgatagagttccaacaaaagaacaacagactactagagatta 
ccagggaatttagtgttaatgcaggtgtaaGtacacctgtaagcacttat 
atgttaacaaatagtgaattattatcattaatcaatgatatgcctataac 
aaatgatcagaaaaagttaatgtccaacaatgttcaaatagttagacagc 
aaagttactctatcatgtccataataaaggaggaagtcttagcatatgta 
gtacaattaccactatatggtgtaatagatacaccttgttggaaactgca 
cacatcccctctatgtacaacGaacacaaaggaagggtcGaacatctgtt 
taacaagaaccgacagaggatggtactgtgacaatgcaggatcagtatct 
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ttcttcccactagctgaaacatgtaaagttcaatcgaatcgagtattttg 
tgacacaatgaacagtttaacattaccaagtgaagtaaatctctgcaaca 
ttgacatattcaaccccaaatatgattgcaaaattatgacttcaaaaaca 
gatgtaagcagctccgttatcacatctctaggagccattgtgtcatgcta 
tggcaaaactaaatgtacagcatccaataaaaatcgtggaatcataaaga 
cattttctaacgggtgtgattatgtatcaaataagggggtggacactgtg 
tctgtaggtaatacattatattatgtaaataagcaagaaggcaaaagtct 
ctatgtaaaaggtgaaccaataataaatttctatgacccattagtgttcc 
cctctgatgaatttgatgcatcaatatctcaagtcaatgagaagattaac 
cagagcctagcatttattcgtaaatccgatgaattattacataatgtaaa 
tgctggtaaatccaccacaaatatcatgAACAATGAGTTTATGGAAGTTA 
CAGAAAAGATCCAAATGGCATCGGATAATATTAATGATCTAATACAGTCA 
GGAGTGAATACAAGGCTTCTTACAATTCAGAGTCATGTCCAGAATTATAT 
ACCaATATCATTGACACAACAAATGTCGGATCTTAGGAAATTCATTAGTG 
AAATTACAATTAGGAATGATAATCAAGAAGTGCCTCCACAAAGAATAACA 
CATGATGTGGGCATAAAACCTTTAAATCCAGATGATTTTTGGAGATGCAC 
GTCTGGTCTTCCATCTTTAATGAAAACTCCAAAAATAAGGTTAATGCCGG 

G G C C G G G AT T ATTAGUTAT GUC AA^AUTGTTGATGGCT GTGTTAGAACT 
GCGTCCTTAGTTATAAATGATCTGATTTATGCTTATACCTCaAATCTAAT 
TACTGGAGGTTGCCAGGATATAGGAAAATCATATCAAGTATTACAGATAG 
GGATAATAACTGTAAACTCAGACTTGGTACGTGACTTAAATCGTAGGATC 
TCTCATACTTTCAACATAAATGACAATAGAAAGTCATGTTCTCTAGCACT 
CCTAAAtACAGATGTATATGAACTGTGTTCGACTCGGAAAGTTGATGAAA 
GATCAGATTATGCATCATCAGGCATAGAAGATATTGTACTTGATATtGTC 
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AATCATGATGGTTCAATCTCT^CAACAAGATTTAAGAACAATAATATT^AG 
TTTTGATCAACCATATGCGGCATTATACCCATCTGTTGGACCAGGGATAT 
ACTACAAAGGCAAAATAATATTTCTCGGGTATGGAGGTCTTGAACATCCA 
ATAAATGAGAATGCAATCTGCAACACAACTGGGTGTCCCGGGAAAACGCA 
GAGAGACTGCAATCAGGCATCTCATAGTCCcTGGTTTTCAGACAGAAGGA 
TGGTCAACTCCATTATTGTTGTTGACAAGGGCTTAAACTCAATTCCAAAA 
CTGAAGGTATGGACGATATCCATGAGACAAAATTACTGGGGGTCAGAAGG 
AAGGCTACTTCTACTAGGTT^ACAAGATCTATATATATACAAGATCTACAA 
GTTGGCATAGCAAGTTACAATTAGGAATAATTGATATTACTGATTACAGT 
GATATAAGAATAAAATGGACATGGCATAATGTGtTATCAAGACCAGGAAA 
CAATGAATGTCCATGGGGACATTCATGtCCAGATGGATGTATAACAGGAG 
TATATACTGATGCATATCCgCTCAATCCCACAGGGAGCATTGTGTCATCT 
GTCATATTAGACTCGCAAAAATCGAGAGTAAACCCAGTCATAACTTACTC 
AACAtCAACTGAAAGGGTAAACGAGCTGGCCATCCGAAACAAAACACTCT 
CAGCTGGATATACAACAACGAGCTGCATTACACACTATAACAAAGGATAT 
TGTTTTCATATAGTAGAAATAAATCATAAAAGCTTAGACACATTCCAACC 
TATGTTGTTCAAAACAGAGATTCCAAAAAGCTGCAGTTAA 
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Fig. ^ 33C 

(Linear) MAP of: FrhumHNphum . seq check: 9920 from: 1 to: 
3090 Humanised nucleic acids sequence of FRSVHNPiV3 



GGGTGGTACACtAGtGTGATCACCATCGAGCTGAGCAACATCAAGGAGAA 
CAAGTGCAACGGCACCGACGCCAAGGTGAAGCTGATCAAGCAGGAGCTGG 
ACAAGTACAAGAGCGCCGTGACCGAGCTGCAGCTGCTGATGCAGAGCACC 
CCCGCCACCAACAACagaGCCAGGCGCGAGCTGCCCAGGTTCATGAACTA 
CACCCTCAACAACACCAAGAACACCAACGTGACCCTGAGCAAGAAGcGcA 
AGaggCGcTTCCTGGGCTTCCTGCTGGGCGTGGGCTCCGCCATCGCCAGC 
GGCATCGCGGTGTCCAAGGTCCTGCACCTGGAGGGGGAGGTGAACAAGAT 
CAAGAGCGCCCTGCTCTCCACCAACAAGGCGGTGGTCAGCCTGTCCAACG 
GCGTGAGCGTGCTGACCAGCAAGGTGCTGGACCTCAAGAACTACATCGAC 
AAGCAatTGCTCCCCATCGTGAACAAGCAGtcCTGCAGCATCTCTAACAT 
TGAGACCGTGATCGAGTTCCAGCAGAAGAACAACAGGCTGCTGGAGATCA 
CCAGGGAGTTCAGCGTGAACGCgGGcGTcACCACCCCGGTGAGCACCTAC 
ATGCTGACCAACAGCGAGCTGCTGTCCCTGATCAACGACATGCCCATCAC 

AGAGCTACagCATCATGagCATCATCAAGGAGGAGGTGGTGGCCTAGGTG 
GTGCAGC.TGCCCCTGTACGGCGTGATCGACACCCCCTGCTGGAAGCTGCA 
CAGCTCCCCCCTGTGCACCACCAACACCAAGGAGGGG-TCCAACATCTGCC 
TGACCCGCACCGACCGGGGCTGGTACT.GCGACAACGCCGGCTCCGTGTCC 
TTGTTGCCCCTGGCGGAGACCTGCAAGGTGCAGTCCAACCGCGTGTTCTG 
CGACACCATGAAGAGCCTGACCCTGCCCAGCGAGGTGAACCTCTGGAACA 
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TCGACATCTTCAACCCCAAGTACGACTGCAAGATtATGacctccaagacc 
gacgtgagcagctccgtgatcacctccctgggcgccatcgtgtcctgcta 
cggcaagaccaagtgtacagcctccaacaagaaccgcggcatcatcaaga 
ccttctccaacggctgcgactacgtgtccaacaagggcgtggacaccgtg 
tccgtgggcaacaccctgtactacgtgaacaagcaggagggcaagagcct 
gtacgtgaagggcgagcccatcatcaacttctacgacccgctggtgttcc 
cctccgacgagttcgacgcctccatctcccaggtgaacgagaagatcaac 
cagagcctggcct teat ccgcaagtccgacgagct get gcacaacgtgaa 
cgccgccaagtccaccaccaacatcatgaacaacgagttcatggaggtga 
ccgagaagatccagatggcctccgacaacatcaacgacctgatccagtcc 
ggcgtgaacacccggctgctgaccatccagagccacgtgcagaactacat 
ccccatctccctgacccagcagatgtccgacctgcggaagttcatcagcg 
agatcaccatccggaacgacaaccaggaggtgcccccccagaggatcacc 
cacgacgtgggcataaagcccctgaaccccgacgacttctggcgctgcac 
ctccggcctcccctccctgatgaagacccccaagataaggctgatgcccg 
ggcccggcctgctggccatgcccaccaccgtggacggctgcgtgcgcacc 
ccctccctggtgatcaacgacctgatctacgcctacacctccaacctgat 

e acccgcg'gt: tg-cca qxp&o art cg^g e aagt'cc t -a - c o asg g*t -g c t*g co-gs t-c-g 
gcatcatcaccgtgaactccgacctggtacccgacctgaacccccggatc 
tcccacaecttcaacatcaacgacaacaggaagtcctgctccctggccct 
cctgaacaccgacgtgtaccagctgtgctccacgcccaaggtggacgagc 
gctccgactaGgccagctccggcatcgaggacatcgtgctggacatcgtc 
aaceacgacggctccatctccaccacccgcttcaagaacaacaacatcag 
cttcgaccagccctacgccgccctgtacccctccgtgggcccGggcatct 
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actacaagggcaagatcatcttcctgggctacggcggcctggagcacccc 
atcaacgagaacgccatctgcaacaccaccgggtgccccggcaagaccca 
gcgggactgcaaccaggcctcccacagcccctggttctccgaccgccgca 
tggtgaactccatcatcgtggtggacaagggcctgaactccatccccaag 
ctgaaggtgtggaccatctccatgcggcagaactactggggctccgaggg 
ccgcctgctgctgctgggcaacaagatctacatctacacccgctccacca 
gctggcacagcaagctgcagctgggcatcatcgacatcaccgactacagc 
gacatccgcatcaagtggacctggcacaacgtgctgagccggcccggcaa 
caacgagtgcccctggggccactcctgccccgacggctgcatcaccggcg 
tgtacaccgacgcctaccccctgaaccccaccggcagcatcgtgagctcc 
gtgatcctggactcccagaagtcccgggtgaaccccgtgatcacctacag 
cacctccaccgagcgcgtgaacgagctggccatccgcaacaagaccctga 
gcgccggctacaccaccaccagctgcatcacccactacaacaagggctac 



tgcttccacatcgtggagatcaaccacaagagcctggacaccttccagcc 
catgctgttcaagaccgagatccccaagagctgcagctaa 

[SEQ ID NO : 56] 
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Fig. 34 A 
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Fig. 34B: Humanization impact on the level of expression of F RSV HN PiV 3 
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Fig 35: Codon usage of F MuV H Mv and highly expressed human genes (hum high exp) 
The frequencies (xlOO) of the individual codons are shown for each of the 
degenerately encoded amino acids, and the most prevalent codon is shown in bold. 
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Fi E 36: Schematic diagram of the PCR synthesis of each fragment in which X and 
Y Ire restriction sites that allow retrieval of the full size fragment from the cloning 

vector. 
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Fig 37 : Sequence of the 12 oligonucleotides from which PCR fragment A was 
generated. 



Doli 1 FmuvHmv 1-98,110111 ARN 

qqtctagaccaccATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAG 

+ + + + + + 60 

C AGC AT C T GC GT G AAC ATC AAC AT CC T GC AGC AG AT C G 
+ 4- +' 98 



2) oli 2 FmuvHmv 82-181, inv comp ARN 

GTTGGGCAGCAGCTTGACCACCACGTAGGAGCTGGAGCTCTGGGAGTAGTAGCTCAGCTG 

+ + 4- + + + 60 

CCTCACCTGCTGCTTGATGTATCCGATCTGCTGCAGGATG 
+ + 4- + 100 

3) oli 3 FmuvHmv, 166-2 64 horn ARN 

CAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAAGAGCGTGACCCA 

+ 4- + + + + 60 

GTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCGC 
+ + 4- 99 



4) oli 4 FmuvHmv, 250-352, inv compARN 

CAGGGCGGCGATGCCGATGGCGATGCCGGCGAACCGCTTGTGCCGCCGGGAGCCGGGGGA 

+ + 4- + 4 + 60 

GGGGGAGGTGATGTTGTTGATGTTCTCGGCGATGGGCAGCAGC 
+ + 4- + 103 



5) oli 5 FmuvHmv, 338-441, horn ARN 

GGCATCGCCGCCCTGGGCGTGGCCACCGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTG 

+ 4- 4- + + + 60 

CAGGCCCAGACCAACGCCCGCGCCATCGCCGCCATGAAGAACTC 
. 4. 4- + TlT4 



6) oli 6 FMuvHmv, 427-523, inv comp ARN 

GTCCTGGATGd^CTGCAeGGCGATGGCCAGCTGCTGGGTGCCCTCCTTCACCTCGAACAC 

+ + +_ 4- + 60 

GGCGCGGTTGGT-GGCCTGGATGGAGTTCTTCATGGCG 
4- + ~ 97 



7) oli 7 EMUVHM 509-610, horn ARN 

CAGGCCATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGTCCTGC 



4- 60 



CAGATGCTGGACAACCAGCTGGGCACCTCCCTGGGGCTGTAC 
+ + -4— 102 
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CACGGTGGTCAGCTCGGTCAGGTACAGGCCCAGGGAG 
+ + + y ' 

ACCTCCATCAGCGCCGCCGAGATCCTGAGCGCCGGCCTGATG 

+ + + + 

^ggac.gggxc^gxggg^ 



- 4— 



CAGCACGGACACGATCTGGCCCTCCATCAGGCCGGCGCTC 

-4- + + 



CAGGAGTCCATCATCCAGCTGCCCGACCGCATCCTGGAGATC 

+ + 

GCTCAGCCGCTCGGCCTCGTTGTACTGGCAGAAGA 

CTTGGCGGGGTAGCGCCACT 

+ + + + " XUJ 

[SEQ ID NOS: 57-68 respectively] 
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Fig. 38 : Sequence of the 9 oligonucleotides from which PCR fragment B was 

generated. 

i?i nli 12 FMUHM, 935-1039, inv compARN 
GCTCAG^COT 

+ + + + + + bU 

CTTGGCGGGGTAGCGCCACTGCTCGTTGCCGATCTCCAGGATGCG 

+ + + + 10b 



GCCGAGCGGCTGAGC^ 

_ + + 



-+ + 60 



TTCTCCAGCATCGCCGGCAGCTACATGCGCCGCTTCGTGGCCCTG 
+ + A h 10b 

i *n n ^ 14 1115-1216. inv comp ARN 

GGCGTGGTGGTCGGGCTGGTAGATGGGGTAGGAGGGGCTCTTGCACAGGCAGGTCAGGCT 

+ + + + + 

GCGGCAGTTGGCCACGATGGTGCCGTCCAGGGCCACGAAGCG 

+ + + 102 

CCCGACCAckcGCCGTGAC^cScG^CCTGACCTCCTGC 

+ + + + + + bU 

GGCCTGGACTTCAGCATCGTGTCCCTGAGCAACATCAC 

+ + + 98 

ni i "I 6 1?8S-1387, inv comp ARN 

c^gScagctcggtggaga^gtcgatgggctgggtgttgatggtctggctcaggctgat 

+ + + + + + bu 

GGTCAGGTTCTCGGCGTAGGTGATGTTGCTCAGG 

+ + + 94 



1*7 1 nli 17 1363-1462, hom ARN 

CACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACATCAAGGAGAG 

1. — 

caaccaccagctgcagagcgtgagcgtgagcagcaagcgc 

_ + + + 100 



+ - + — +- + + 60 



ip> 0 i-i io 1447-1550, inv comp ARN 

tcacctggtgctcgatggagttggtcacgtccaggttggtgctcaggctcttgtggatct 

+ + + + bu 

cggcggtgtagatggcggcgcggtgcaggcgcttgctgctcacg 

+ + + + 104 
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19) oli 19, 1534-1636, horn ARN 

CATCGAGCACCAGGTGAAGGACGTGCTGACCCCCCTGTTCAAGATCATCGGCGACGAGGT 

+ + + + + + 60 

GGGCCTGCGCACCCCCCAGCGCTTCACCGACCTGGTGAAGTTC 
, + + + 103 



20> oli 20 FmuvHmv, 1622-1718, inv comp ARN 

GCTCGGGGGGGTTGATGCACCAGGTCAGGTCGCGGAAGTCGTACTCGCGGTCGGGGTTCA 
+ + + + + + 60 

GGAACTTGATCTTGTCGGAGATGAACTTCACCAGGTC 



[SEQ ID NOS: 69-77 respectively] 
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Fig. 39 : Sequence of the 11 oligonucleotides from which PCR fragment C was 
generated. 

20) oli 20 FmuvHmv, 1622-1718, inv corap ARN 

GCTCGGGGGGGTTGATGCACCAGGTCAGGTCGCGGAAGTCGTACTCGCGGTCGGGGTTCA 

+ + + + + + 60 

GGAACTTGATCTTGTCGGAGATGAACTTCACCAGGTC 

. + + 97 



21) oli 21, FmuvHmv, 1701-1799, horn ARN 

GCATCAACCCCCCCGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCG 
+ + 

CCGAGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGC 

, + + 99 



+ + + + 60 



22) oli 22,1784-1888, inv comp 

CATGTTGCTGAACTGGCCCCGGATGGTGGTGGGGCCGCTGCAGTTGCCCTTGCTCACGGC 

+ + + + + + 6° 

CAGGAACTGGTTGGTGGTGCGGGTCTCCAGCAGGGTGCTGTTCAC 
+ + + + 105 

23) oli 23, 1874-1971, horn ARN 

CAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACCTGGGCCGGGGCTACAACGTG 

+ + + + + + 60 

AGCAGCATCGTGACCATGACCAGCCAGGGCATGTACGG 
+ + + 98 

24) oli 24, 1957-2057, inv. comp ARN 

CCACCTCGAACACGCGGTACATGCTCAGCTGGCTCAGCTCGCTCCGCTTGCTGCTCAGGT 

+ + + + + + 60 

TGGGCTTCTCCACCAGGTAGGTGCCGCCGTACATGCCCTGG 
+ + + +- 101 

25) oli 25, FmuvHmv, 2043-2140, homARN 

GCGTGTTCGAGGTGGGCGTGATCCGGAACCCCGGCCTGGGCGCCCCCGTGTTCCACATGA 

+ + + + + + 60 

CCAACTACCTGGAGCAGCCCGTGAGCAACGACCTGAGC 
+ + + 98 

2 6) oli2G," FmuvHmv,- 2125-2227 , j inv co.mpARN 

GCCGCTGCCCTGGTAGGGGATGGTGATGCTGTCCTCGGCGTGGCACAGGGCGGCCAGCTT 

. I | \ \- 60 

CAGCTCGCCCAGGGCCACCATGCAGTTGCTCAGGTCGTTGCTC 
.„_+ + + + 103 

27) oli 27, 2212-2309,. FmuvHm,: horn ARN 

CTACCAGGGCAGCGGGAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTGTGGAAGAG 

. i „_ _j h 60 

CCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCG 
+ + + 98 
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+ + + + 

CAGGCGGTCGATCACGGGGTCGTCGGTGCTCAGGGGCAC 
+ + + 99 

oos «H ?Q pmuv Hmv, 2377-2477, horn ARN 

GTGGGCCGTGCCCACCAC^CGCACCGACGACAAGCTGCGCATGGAGACCTGCTTCCAGCA 

+ + + + + " 

GGCCTGCAAGGGCAAGATCCAGGCCCTGTGCGAGAACCCCG 
+ + + + " 101 

m\ nii ^0 FmuvHmv 2462-2561, inv comp 

tgat?^cagctSaSStcaggctcaggtccacgctcagcac^ go 



-+- 



-+ +- 



GGTTGTCCTTCAGGGGGGCCCAtTCGGGGTTCTCGCACAG 

, + + + 100 



[SEQ ID NOS: 76-88 respecively] 
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Fig. 40 : Sequence of the 8 oligonucleotides from which PCR fragment D was 
generated. 

30) oli 30, FmuvHmv 2462-2561, inv comp 

TGATCTTCAGCTCCACGGTCAGGCTCAGGTCCACGCTCAGCACGCCGTAGCTGGGGATGC 

+ + + + + + 60 

GGTTGTCCTTCAGGGGGGCCCAtTCGGGGTTCTCGCACAG 

+ + + + 100 



31) oli31, FmuvHmv, 2546-2649, horn ARN 

GTGGAGCTGAAGATCAAGATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGC 

+ 4- + + + + 60 

ATGGACCTGTACAAGAGCAACCACAACAACGTGTACTGGCTGAC 

, + + + 104 



32) oli 32, FmuvHmv, 2635-2738, inv comp ARN 

CGGTGAACAGGTAGGGGCTCACCTTGAAGCGGGGaATCCACTCCAGGGTGTTGATCACGC 

+ + 4- + + 4- 60 

CCAGGGCCAGGTTCTTCATGGGGGGGATGGTCAGCCAGTACACG 
. + 4- 4- 104 



33) oli 33, FmuvHmv, 2723-2827, horn ARN 

CCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGACTGCCACGCCCCGACCTAC 

+ + 4- + + + 60 

CTGCCCGCCGAGGTGGACGGCGACGTGAAGCTGAGCAGCAACCTG 
+ + 4- 4- 105 



34) oli 34, FMUVHmv, 2813-2911, inv comp ARN 

CACGTAGTACACCACGGCGTGCTCCACGCGGCTGGTGTCGTAGGTGGCCAGCACGTACTG 

+ + 4- + + + 60 

CAGGTCCTGGCCGGGGAGGATCACCAGGTTGCTGCTCAG 
4- + 4- 99 



35) oli 35 TMUHM, 2897-2995, homARN 

GTGGTGTACTACGTGTACAGCCCCGGCCGCAGCTTCTTCTACTTCTACCCCTTCCGCCTG 

: _ + + 4- + + + 60 

GCCATCAAGGGCGTGCCCATCGAGCTGCAGGTGGAGTGC 
4- 4- 4- 99 



36) oli 36, FmuvHmv, 2981-3078, inv comp ARN 

CCGCTGTGGGTGATGTGGCCGCCGCTCTCGCTGTCGGCCAGCACGCAGAAGTGGCGGCAC 
+ + + + + + 60 

CACAGCTTCTGGTCCCAGGTGAAGCACTCCACCTGGAG 
4- 4- 4- 98 
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37) oli 37, 3064-3147, homARN 

CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCCGCGAGGACGG 

+ + + + + + 60 

CACCAACCGCCGCTAGcgaattcc 
+ + 84 



[SEQ ID NOS: 89-96 respectively] 
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Fig. 41 : Construction of pEE14F MuV hum HN MV hum 
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Figure 42 A : Humanised nucleic acids sequence of F^yHj^v (upper 
sequence) compared to the original F MuV H MV sequence (lower sequence) and 
the corresponding amino acids sequence. 



1 4 ATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAGCAG 

| | M I ! M II M II M IN I M I I I I M I I I I I M I 
1 ATGAAGGCTTTTCCAGTTATTTGCTTGGGCTTTGCAATCTTTTCATCCTC 

6 4 CATCTGCGTGAACATCAACATCCTGCAGCAGATCGGATACATCAAGCAGC 

|| || | M | I I I I I I III MINI! It I I i I I M i I I I I M I 
5 1 TATATGTGTGAATATCAATATCTTGCAGCAAATTGGATACATCAAGCAAC 



63 
50 
113 
100 



114 AGGTGAGGCAGCTGAGCTACTACTCCCAGAGCTCCAGCTCCTACGTGGTG 163 

| | | | | I I I I I I Mill M II M I M I I I I I I I I I M 

AGGTCAGGCAACTAAGCTATTACTCACAAAGTTCAAGCTCCTACGTAGTG 150 



101 

■ • 

164 GTCAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAA 213 

| | | | | | | | | || II Mill IMM II M II I I I I M M M 
151 GTCAAGCTTTTACCGAATATCCAACCCACTGATAACAGCTGTGAATTTAA 

214 GAGCGTGACCCAGTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCG 

| | t M II II Mill MUM MM M I II I M M M I 
GAGTGTAACTCAATACAATAAGACCTTGAGTAATTTGCTTCTTCCAATTG 



201 
264 
251 
314 



CCGAGAACATCAACAACATCACCTCCCCCTCCCCCGGCTCCCGGCGGCAC 

| | | I I I I I I I I M I I M I I M II I I I M I I IMM 
CAGAAAACATAAACAATATTACGTCGCCCTCACCTGGGTCAAGACGTCAT 



200 
263 
250 
313 
300 
363 



AAGCGGTTCGCCGGCATCGCCATCGGCATCGCCGCCCTGGGCGTGGCCAC 

II | | I 11 II I I I M M I I I I I M I I i IMM II M M II 
301 AAACGGTTTGCTGGCATTGCCATTGGCATTGCgGCcCTCGGTGTTGCGAC 350 

3 64 eGCCGCCGAGGTGACCGeCGCCGTGTCCCTGGTGCAGGCCCAGACCAACG 413 

| M II I I I I I M I M II II I 1 I" M M M, M I M III 
351 -CGCAGCACAAGTGACTGCCGCTGTCTCATTAGTTCAAGCACAGACAAATG 4 00 

4 ] 4 CCGGCGCCATCGGCGGCATGAAGAACTCCATCCAGGCCACCAACGGCGCC 4 63 

i M M II II M Mill M II Ml Mill II II \ I M 
401 CACGT.GCAATAGGAGCGATGAAAAATTCAATACAGGCAACTAATCGGGGA 4 50 

4 64 GTGTTCGAGGTGAAGGAGGGCACCCAGCAGCTGGCCATCGCCGTGCAGGC 513 

1 1 i m ii it 1 1 1 M r i ii m 1 1 1 i ii i ii ii i r i i 1 1 1 1 rtn 

4 51 G TC T-T C.G AAGT GAAG G AAGGC AC C CAAC AGT TAGCT AT AGC GGT AC AAG C 50 0 

514 CATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGT 563 

MUM I M I 1 Ml II II M I I II II I I M I I I M M I MM 
501 CAT C CAAG AG CAT AT C AAT AC TAT TAT G AAC AC CC AAT TGAAC AAT AT G T 550 
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564 CCTGCCAGATCCTGGACAACCAGCTGGCCACCTCCCTGGGCCTGTACCTG 613 

| || | | | I I I I I II I I I I I I I I II I I 1 I I I 1 I II I I M I I 
551 CTTGTCAGATCCTTGATAACCAGCTTGCAACCTCCCTAGGATTATACCTA 600 

614 ACCGAGCTGACCACCGTGTTCCAGCCCCAGCTGATCAACCCCGCCCTGtC 6 63 

Mil | I I I I I I I II I I I I I I I I I I I I I I M I I II 
601 ACAGAATTAACAACAGTGTTTCAGCCACAATTAATTAATCCAGCATTGTC 650 

664 cCCCATCAGTATCCAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCG 713 

| | | | | | | I | I I II I I 1 I II I I I I I ( I I I I I I I I I I I 
651 ACCGATTAGTATACAAGCCTTGAGGTCTTTGCTTGGAAGTATGACACCTG 70 0 

714 CCGTGGTGCAGGCCACCCTGAGCACCTCCATCAGCGCCGCCGAGATCCTG 7 63 

I I 1 I II I I I I I I I I I I I I I I ! II I I I I I I 

7 01 CAGTGGTTCAAGCAACATTATCTACTTCAATTTCTGCTGCTGAAATACTA 7 50 

7 64 AGCGCCGGCCTGATGGAGGGCCAGATCGTGTCCGTGCTGCTGGACGAGAT 813 

|| Mill II I I I I I I I I MM! II If II II I II M I I I I I 

7 51 AGTGCCGGTCTAATGGAGGGTCAGATAGTTTCTGTTCTGCTAGATGAGAT 8 00 

814 GCAGATGATCGTGAAGATCAACGTGCCCACCATCGTGACCCAGTCCAACG 8 63 

I I II I I I II M I I I I I I II II II II II I I I M II II II I 

8 01 GCAGATGATAGTTAAGATAAACGTTCCAACCATTGTCACACAATCAAATG 8 50 

8 64 CCCTGGTGATCGACTTCTACAGCATCAGCAGCTTCATCAACAACCAGGAG 913 

I II I f II I II II II I I I M M II II I i I I M II 

8 51 CATTGGTGATTGACTTCTACTCAATTTCGAGTTTTATTAATAATCAAGAA 900 

914 TCCATCATCCAGCTGCCCGACCGCATCCTGGAGATCGGCAACGAGCAGTG 963 

Mill M II I I M III I III II II I M I II M M M II 
901 TCCATAATTCAATTGCCAGACAGGATCTTGGAGATCGGAAATGAACAATG 9 50 

9 64 GCGCTACCCCGCCAAGAACTGCAAGCTGACCCGCCACCACATCTTCTGCC 1013 

|| M II II II I I II I II III MM I II II II M I M II I I 
9 51 GCGCTATCCAGCTAAGAATTGTAAGTTGACAAGACACCACATATTCTGCC 100 0 

1014 AGTACAAC GAGGCCGAGCGGCTGAGCCTGGAGACCAA'GCTGTGCCTGGUC "T0'b3 

I M.I || Mill III I I M I I II I I .11.1111 M Mill II 
1001 AATACAAT GAGGCAGAGAGGCTGAGCCTAGAAACAAAACTATGCCTTGCA 1050 

. • • • " 

.1064 GGCAACATCAGCGCCTGCGTGTTCTCCAGCATCGCCGGCAGCTACATGCG 1113 

Mill II II II I M I II M I I I- II II- II M II Ml I 
10 51 GGGAATATTAGTGCCTGTGTGTTCTCATCTATAGCAGGGAGTTATATGAG 1100 

1114 CCG'CTTCGTGGCCCTGGACGGCACCATCGTGGCCAACTGCCGCAGCCTGA 1163 

II II I I II I I M I \ I M II 1 I II IM I I II M M I 
1101 GCGATTTGTAGCACTGGATGGAACAATTGTTGCAAACTGTCGAAGTCTAA 1150 

1164 - CCTGCCTGTGCAAGAGCCCCTCCTACCCCATCTACCAGCCCGACCACCAC- 1213 

I II II I I I II I II II II II II II II I M I I Mill II 
1151 CGTGTCTATGCAAGAGTCCATCTTATCCTATATAGCAACCTGACCATCAT 1200 
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1214 GCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGACGG 1263 

| | | | I I I I I I 1 I I I I I I I t I I I I I 1 t I I 1 I I I I M I 

1201 GCAGTCACGACCATTGATCTAACGTCATGTCAAACATTGTCCCTGGACGG 1250 

12 64 CCTGGACTTCAGCATCGTGTCCCTGAGCAACATCACCTACGCCGAGAACC 1313 

DIM | | I I I I I [ II II II I I I I I I I I I t I I I I I 

12 51 ACTGGATTTCAGCATTGTCTCGCTAAGCAACATCACTTACGCTGAGAATC 13 00 

1314 TGACCATCAGCCTGAGCCAGACCATCAACACCCAGCCCATCGACATCTCC 13 63 

| | | | | M I I I II I I I I I I I I I I I II I I I I I E I 

1301 TTACTATTTCATTGTCTCAGACAATCAATACTCAACCCATTGATATATCA 1350 

1364 ACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACAT 1413 

|| | | | i | | I I I I I I I II II Mill M I I Mill II Mill 

1351 ACTGAGCTGAGTAAGGTTAATGCATCCCTCCAAAATGCCGTTAAATACAT 14 0 0 

1414 CAAGGAGAGCAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGCC 14 63 

M II I I II I I I I II I M II I I II II 

14 01 AAAAGAGAGTAACCATCAACTCCAATCCGTTAGTGTAAGTTCTAAAAGAC 1450 

14 64 TGCACCGCGCCGCCATCTACACCGCCGAGATCCACAAGAGCCTGAGCACC 1513 

| M M M I M II II M I II I I M M M II M I M II I I I M I 

14 51 TTCATCGGGCAGCCATCTACACCGCAGAGATCCATAAAAGCCTCAGCACC 1500 

1514 AACCTGGACGTGACCAACTCCATCGAGCACCAGGTGAAGGACGTGCTGAC 15 63 

M || M M M I M I I M II M M II I I I I M I I II I I II I II 

1501 AATCTAGATGTAACTAACTCAATCGAGCATCAGGTCAAGGACGTGCTGAC 1550 

15 64 CCCCCTGTTCAAGATCATCGGCGACGAGGTGGGCCTGCGCACCCCCCAGC 1613 

|| II I II I I II I M I I I II M M M M I I I I M M Ml 

1551 ACCACTCTTCAAAATCATCGGTGATGAAGTGGGCCTGAGGACACCTCAGA 1600 

1614 GCTTCACCGACCTGGTGAAGTTCATCTCCGACAAGATCAAGTTCCTGAAC 1663 

| MMI IMM I I I I I I M I I I M I II I II I J M I I I I I M 

1601 GATTCACTGACCTAGTGAAATTCATCTCTGACAAGATTAAATTCCTTAAT 1650 

%&54 C~ C OG ACGGOoA GT-AGSACT T-CC-G GGACC T-SACCT-GGT-GCAT-GAAG C G GCC 171 2 

M | I I I II II M I I I I I I I I M I I I II II Mil M II M 

1651 CCGGATAGGGAGTACGACTTCAGAGATCTCACTTGGTGTATCAACCCGCC 1700 

1714 CGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCGCCG 17 63 

I II I Mill 'I I II II" M M I 1:1 II M t I IMM II I 

1701 AGAGAGAATCAAATTGGATTATGATCAATACTGTGCAGATGTGGCTGCTG 1750 

1 7 64 - AGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGCTGGAGACCCGCACC 1813 

I I M II I Ml I I I I I I M II I I 1 M M I M I I II I M 

1751 AAG AG CT C AT GAAT GC AT TGGT GAAC T CAA CT CTAC T GGAGACC AGAAGA 1800 

1814 ACCAACCAGTTCCTGGCGGTGAGCAAGGGCAACTGCAGCGGCCCCACCAC 1863 

I II I I M I II I I I M I I I M II I II I I I I I I I I I I I I qca 

1801 ACCAATCAGTTCCTAGCTGTCTCAAAGGGAAACTGCTCAGGGCCCACTAC 18 50 
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1864 
1851 
1914 



CATCCGGGGCCAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACC 

||[ | || || III Mill I I I I I I I I I I Mi I M 1 

AATCAGAGGTCAATTCTCAAACATGTCGCTGTCCCTGTTAGACTTGTATT 



1913 
1900 
1963 



TGGGCCGGGGCTACAACGTGAGCAGCATCGTGACCATGACCAGCCAGGGC 

| || II M 11111 II I I I I I I I I I I I I I I I I I I 

1901 TAGGTCGAGGTTACAATGTGTCATCTATAGTCACTATGACATCCCAGGGA 1950 



1964 
1951 
2014 



ATGTACGGCGGCACCTACCTGGTGGAGAAGCCCAACCTGAGCAGCAAGCG 

I I | I I || || II I I I I I I I I I I I I I I I I I M I II I II I I I I 
ATGTATGGGGGAACTTACCTAGTGGAAAAGCCTAATCTGAGCAGCAAAAG 



2013 
2000 
2063 



GAGCGAGCTGAGCCAGCTGAGCATGTACCGCGTGTTCGAGGTGGGCGTGA 

| Ml M II M I I I I I I M I I I I I II I I M M M I I I 
2001 GTCAGAGTTGTCACAACTGAGCATGTACCGAGTGTTTGAAGTAGGTGTTA 2050 



2 064 TCCGGAACCCCGGCCTGGGCGCCCCCGTGTTCCACATGACCAACTACCTG 2113 

|| | || || II I I I I II II I I I I I ! I I Mill I I I I I II 
2 0 51 TCAGAAATCCGGGTTTGGGGGCTCCGGTGTTCCATATGACAAACTATCTT 



2100 



2114 GAGCAGCCCGTGAGCAACGACCTGAGCAACTGCATGGTGGCCCTGGGCGA 2163 

| | | | | M M M || I 1 1 I I I I I I I I I I I I I I I I I I I M t I 
GAGCAACCAGTCAGTAATGATCTCAGCAACTGTATGGTGGCTTTGGGGGA 



2101 
2164 
2151 



GCTGAAGCTGGCCGCCCTGTGCCACGGCGAGGACAGCATCACCATCCCCT 

Ml M || || I I 11 I II I I I I I II M I I I I I II I > * I 

GCTCAAACTCGCAGCCCTTTGTCACGGGGAAGATTCTATCACAATTCCCT 



2150 
2213 
2200 



2214 ACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTG 2263 

I | | | I I I I II M M I I II I I I I I 1 I II INN II II 

2201 ATCAGGGATCAGGGAAAGGTGTCAGCTTCCAGCTCGTCAAGCTAGGTGTC 2250 

22 64 TGGAAGAGCCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCGACGA 2313 

Mill I II I I I I I I I I I I I I I I I I I I I I I I I I I II 

2251 TGGAAATCCCCAACCGACATGCAATCCTGGGTCCCCTTATCAACGGATGA 2300 

2314 CCCCGT.GATCGACCGCCTGTACCTGAGCAGCCACCGCGGCGl'GATCGCCG 23*3 

II I I I I I 1111 II I II M I I I .1 I I I I I I M I I 

2301 TCCAGTGATAGACAGGCTTTACCTCTCATCTCACAGAGGTGTTATCGCTG 2350 

23 61 AGAACCAGGCCAAGTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTG 2413 

1 1 1 1 1 1 1 M ii M 1 1 1 ii n 1 1 M ii ii N i M 1 1 1 ii 

2351 ACAACCAAGCAAAATGGGGTGTCCCGACAACACGAACAGATGACAAGTTG 2400 

2414 CGCATGGAGACCTGCTTCCAGCAGGCCTGGAAGGGCAAGATCCAGGCCCT 2463 

II I I I I I II I I I I I I I I I I I I I I II I IN I M MIN M M 

24 01 CGAATGGAGACATGCTTCCAACAGGCGTGTAAGGGTAAAATCCAAGCACT 2450 

24 64 GTGCGAGAACCCCGAaTGGGCCCCCCTGAAGGACAACCGCATCCCCAGCT 2513 

I I I I I I I I III I I I I M I I.I I I I I M I I I I I I I I I I ocnn 

24 51 CTGCGAGAATCCCGAGTGGGCACCATTGAAGGATAACAGGATTCCTTCAT 2500 
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2514 ACGGCGTGCTGAGCGTGGACCTGAGCCTGACCGTGGAGCTGAAGATCAAG 2 563 

| | | | ! | |[ I I I I I ! I I I I I I I I I I I I I I I I I I I I i I 
2 501 ACGGGGTCTTGTCTGTTGATCTGAGTCTGACAGTTGAGCTTAAAATCAAA 2 5 50 

2564 ATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGCATGGACCT 2 613 

M | | M I I I 1 I I I I I I [ I I I t I I I I MINIMI! 

2551 ATTGCTTCGGGATTCGGGCCATTGATCACACACGGTTCAGGGATGGACCT 2 600 

• • • 

2 614 GTACAAGAGCAACCACAACAACGTGTACTGGCTGACCATCCCCCCCATGA 2 663 

| | I I I | I I I I I I I I I I I I I I I I I I I I I I I I II I II I 

2 601 ATACAAATCCAACCACAACAATGTGTATTGGCTGACTATCCCGCCAATGA 2650 

• • * * 

2 6 64 AGAACCTGGCCCTGGGCGTGATCAACACCCTGGAGTGGATtCCCCGCTTC 2713 

| | | | || | || I I M M M M I M ! MM II 1 M t 

2 651 AGAACCTAGCCTTAGGTGTAAT CAACACATTGGAGTGGATACCGAGATTC 2700 

2714 AAGGTGAGCCCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGA 2763 

I I I I I I I M M M M M M I I I M M M M I M M M I I I 
2 701 AAGGTTAGTCCCTACCTCTTCAcTGTCCCAATTAAGGAAGCAGGCGAAGA 2 7 50 

2 7 64 CTGCCACGCCCCGACCTACCTGCCCGC CG AGGTGGACGGCGACGTGAAGC 2813 

M | | | | | | | | | II 1 M M II II M I I I I I I I I I I M II I 

27 51 CTGCCATGCCCCAACATACCTACCTGCGGAGGTGGATGGTGATGTCAAAC 2 8 00 

2814 TGAGCAGCAACCTGGTGATCCTGCCCGGCCAGGACCTGCAGTACGTGCTG 2 8 63 

III M I M II I II I M II II M I I I I I I I I I I M 

28 01 TCAGTTCCAATCTGGTGATTCTACCTGGTCAAGATCTCCAATATGTTTTG 2 8 50 

2 8 64 GCCACCTACGACACCAGCCGCGTGGAGCACGCCGTGGTGTACTACGTGTA 2913 

M MINIM II I I i I II N II Mill II Mill M 
28 51 GCAACCTACGATACTTCCAGGGTTGAACATGCTGTGGTTTATTACGTTTA 2900 

2914 CAGCCCCGGCCGCAGCTTCTTCTACTTCTACCCCTTCCGCCTGCCCATCA 2963 

M I i M II I I I I I I I I I I I I I M I I II I I I II I I I 
2901 CAGCCCAgGCCGCTCATTTTtTTACTTTTATCCTTTTAGGTTGCCTATAA 2 950 

"2^"6 J 4 'A'GGGCt^XXraTCTK^ 3313 

I M I II I Mill II I It .Mill I III I M I l-l I I I Ml I I 

2951 AGGGGGTCCCCATCGAATTACAAGTGGAATGCTTCACATGGGACCAAAAA 3000 

3014 CTGTGGTGCCGCCACTTCTGGGTGCTGGCCGACAGCGAGAGCGGCGGCCA 3063 

II "MINIM I I I I I IN I I I I I II ill II II II I I 
3001 CTCTGGTGCCGTCACTTCTGTGTGGTTGCGGACTCAGAATCTGGTGGACA 3 050 

3064 CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCC 3113 

Mill III I I I I II III I N II I I II II M I II I I I MM 
3051 TATCACTCACTCTGGGATGGtGGGCATGGGAGTCAGCTGCACAGTCACCC 3100 

3114 GCGAGGACGGCACCAACCGCCGCTAG 3139 

I II II I I -I I II I I I I I III 
3101 GGGAAGATGGAACCAATCGGAGATAG -312 6 
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Fig- 42B : F MU H MV .seq check: 4381 from: 1 to: 3126 
nucleic acid sequence of F MU H MV (non humanised) 

ATGAAGGCTTTTCCAGTTATTTGCTTGGGCTTTGCAATCTTTTCATCCTC 

T AT ATGT GT GAAT AT CAATATCT T GC AGC AAATTGGAT ACATC AAGCAAC 
AGGTCAGGCAACTAAGCTATTACTCACAAAGTTCAAGCTCCTACGTAGTG 
GTCAAGCTTTTACCGAATATCCAACCCACTGATAACAGCTGTGAATTTAA 
GAGTGTAACTCAATACAATAAGACCTTGAGTAATTTGCTTCTTCCAATTG 
CAGAAAACATAAACAATATTACGTCGCCCTCACCTGGGTCAAGACGTCAT 
AAACGGTTTGCTGGCATTGCCATTGGCATTGCgGCcCTCGGTGTTGCGAC 
CGCAGCACAAGTGACTGCCGCTGTCTCATTAGTTCAAGCACAGACAAATG 
CACGTGCAATAGCAGCGATGAAAAATTCAATACAGGCAACTAATCGGGCA 
GTCTTCGAAGTGAAGGAAGGCACCCAACAGTTAGCTATAGCGGTACAAGC 
cATcCAAGACCATATCAATACTATTATGAACACCCAATTGAACAATATGT 
CTTGTCAGATCCTTGATAACCAGCTTGCAACCTCCCTAGGATTATACCTA 
ACAGAATTAACAACAGTGTTTCAGCCACAATTAATTAATCCAGCATTGTC 
ACCGATTAGTATACAAGCCTTGAGGTCTTTGCTTGGAAGTATGACACCTG 
CAGTGGTTCAAGCAACATTATCTACTTCAATTTCTGCTGCTGAAATACTA 
AGTGCCGGTCTAATGGAGGGTCAGATAGTTTCTGTTCTGCTAGATGAGAT 
GCAGATGATAGTTAAGATAAACGTTCCAACCATTGTCACACAATCAAATG 
• CATTGGTGATTGACTTCTACTCAATTTCGAGTTTTATTAATAATCAAGAA 
TCCATAATTCAATTGCCAGACAGGATCTTGGAGATCGGAAATGAACAATG 
GCGCTATCCAGCTAAGAATTGTAAGTTGACAAGACACCAGATATTCTGCC 
AATACAATGAGGCAGAGAGGCTGAGCCTAGAAACAAAACTATGCCTTGCA 
GGCAATATTAGTGCCTGTGTGTTCTCATCTATAGCAGGGAGTTATATGAG 
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GCGATTTGTAGCACTGGATGGAACAATTGTTGCAAACTGTCGAAGTCTAA . 

CGTGTCTATGCAAGAGTCCATCTTATCCTATATACCAACCTGACCATCAT 

GCAGTCACGACCATTGATCTAACGTCATGTCAAACATTGTCCCTGGACGG 

ACTGGATTTCAGCATTGTCTCGCTAAGCAACATCACTTACGCTGAGAATC 

TTACTATTTCATTGTCTCAGACAATCAATACTCAACCCATTGATATATCA 

ACTGAGCTGAGTAAGGTTAATGCATCCCTCCAAAATGCCGTTAAATACAT 

AAAAGAGAGTAACCATCAACTCCAATCCGTTAGTGTAAGTTCTAAAAGAC 

TTCATCGGGCAGCCATCTACACCGCAGAGATCCATAAAAGCCTCAGCACC 

AATCTAGATGTAACTAACTCAATCGAGCATCAGGTCAAGGACGTGCTGAC 

ACCACTCTTCAAAATCATCGGTGATGAAGTGGGCCTGAGGACACCTCAGA 

GATTCACTGACCTAGTGAAATTCATCTCTGACAAGATTAAATTCCTTAAT 

CCGGATAGGGAGTACGACTTCAGAGATCTCACTTGGTGTATCAACCCGCC 

AGAGAGAATCAAATTGGATTATGATCAATACTGTGCAGATGTGGCTGCTG 

AAGAGCTCATGAATGCATTGGTGAACTCAACTCTACTGGAGACCAGAACA 

ACCAATCAGTTCCTAGCTGTCTCAAAGGGAAACTGCTCAGGGCCCACTAC 

AATCAGAGGTCAATTCTCAAACATGTCGCTGTCCCTGTTAGACTTGTATT 

TAGGTCGAGGTTACAATGTGTCATCTATAGTCACTATGACATCCCAGGGA 

•-ATio^ATSGGGGi^C-T^ 
GTCAGAGTTGTCACAACTGAGCATGTACCGAGTGTTTGAAGTAGGTGTTA 

TCAGAAATCCGGGTTTGGGGGCTCCGGTGTTCCATATGACAAACTATCTT 

GAGCAACCAGTCAGTAATGATCTCAGGAACTGTATGGTGGCTTTGGGGGA 

GCTCAAACTCGCAGCCCTTTGTCACGGGGAAGATTCTATCACAATTCCCT 

ATCAGGGATCAGGGAAAGGTGTCAGCTTCCAGCTCGTCAAGCTAGGTGTC 

.TGGAAATCCCCAACCGACATGCAATCCTGGGtCCCCTTATCAACGGATGA 
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TCCAGTGATAGACAGGCTTTACCTCTCATCTCACAGAGGTGTTATCGCTG 
ACAAcCAAGCAAAATGGGCTGTCCCGACAACACGAACAGATGACAAGTTG 
CGAATGGAGACATGCTTCCAACAGGCGTGTAAGGGTAAAATCCAAGCACT 
CTGCGAGAATCCCGAGTGGGCACCATTGAAGGATAACAGGATTCCTTCAT 
ACGGGGTCTTGTCTGTTGATCTGAGTCTGACAGTTGAGCTTAAAATCAAA 
ATTGCTTCGGGATTCGGGCCATTGATCACACACGGTTCAGGGATGGACCT 
ATACAAATCCAACCACAACAATGTGTATTGGCTGACTATCCCGCCAATGA 
AGAACCTAGCCTTAGGTGTAATCAACACATTGGAGTGGATACCGAGATTC 
AAGGTTAGTCCCTACCTCTTCAcTGTCCCAATTAAGGAAGCAGGCGAAGA 
CTGCCATGCCCC7\ACATACCTACCTGCGGAGGTGGATGGTGATGTCAAAC 
TCAGTTCCAATCTGGTGATTCTACCTGGTCAAGATCTCCAATATGTTTTG 
GCAACCTACGATACTTCCAGGGTTGAACATGCTGTGGTTTATTACGTTTA 
CAGCCCAgGCCGCTCATTTTtTTACTTTTATCCTTTTAGGTTGCCTATAA 
AGGGGGTCCCCATCGAATTACAAGTGGAATGCTTCACATGGGACCAAAAA 
CTCTGGTGCCGTCACTTCTGTGTGCTTGCGGACTCAGAATCTGGTGGACA 
TATCACTCACTCTGGGATGGtGGGCATGGGAGTCAGCTGCACAGTCACCC 
GGGAAGATGGAACCAATCGCAGATAG 
tSEQTDT\iO:'97j 
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Fig 42C: F MUV humH M hum.seq check: 5778 from: 14 to: 3139 
Humanised nucleic acids sequence of F MuV H Mv 

ATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAGCAG 
CATCTGCGTGAACATCAACATCCTGCAGCAGATCGGATACATCAAGCAGC 
AGGTGAGGCAGCTGAGCTACTACTCCCAGAGCTCCAGCTCCTACGTGGTG 
GTCAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAA 
GAGCGTGACCCAGTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCG 
CCGAGAACATCAACAACATCACCTCCCCCTCCCCCGGCTCCCGGCGGCAC 
AAGCGGTTCGCCGGCATCGCCATCGGCATCGCCGCCCTGGGCGTGGCCAC 
CGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTGCAGGCCCAGACCAACG 
CCCGCGCCATCGCCGCCATGAAGAACTCCATCCAGGCCACCAACCGCGCC 
GTGTTCGAGGTGAAGGAGGGCACCCAGCAGCTGGCCATCGCCGTGCAGGC 
CATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGT 
CCTGCCAGATCCTGGACAACCAGCTGGCCACCTCCCTGGGCCTGTACCTG 
ACCGAGCTGACCACCGTGTTCCAGCCCCAGCTGATCAACCCCGCCCTGtc 
cCCCATCAGTATCCAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCG 
CCGTGGTGCAGGCCACCCTGAGCACCTCCATCAGCGCCGCCGAGATCCTG 

' 7^GCGCUGGCUT''GAT"GGA"GGGCCA^ 

GCAGATGATCGTGAAGATCAACGTGCCCACCATCGTGACCCAGTCCAACG 
CCCTGGTGATCGACTTGTACAGCATCAGGAGCTTCATGAACAACCAGGAG 
TCCATCATCCAGCTGGCCGACCGCATCCTGGAGATCGGCAACGAGCAGTG 
GCGCTACCCCGCCAAGAACTGCAAGCTGACCCGCCACCACATCTTCTGCC 
AGTACAAGGAGGCCGAGCGGCTGAGGCTGGAGACCAAGCTGTGCCTGGCC 
GGCAACATCAGCGCCTGGGTGTTCTCCAGCATCGCCGGCAGCTACATGCG 
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CCGCTTCGTGGCCCTGGACGGCACCATCGTGGCCAACTGCCGCAGCCTGA 
CCTGCCTGTGCAAGAGCCCCTCCTACCCCATCTACCAGCCCGACCACCAC 
GCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGACGG 
CCTGGACTTCAGCATCGTGTCCCTGAGCAACATCACCTACGCCGAGAACC 
TGACCATCAGCCTGAGCCAGACCATCAACACCCAGCCCATCGACATCTCC 
ACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACAT 
CAAGGAGAGCAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGCC 
TGCACCGCGCCGCCATCTACACCGCCGAGATCCACAAGAGCCTGAGCACC 
AACCTGGACGTGACCAACTCCATCGAGCACCAGGTGAAGGACGTGCTGAC 
CCCCCTGTTCAAGATCATCGGCGACGAGGTGGGCCTGCGCACCCCCCAGC 
GCTTCACCGACCTGGTGAAGTTCATCTCCGACAAGATCAAGTTCCTGAAC 
CCCGACCGCGAGTACGACTTCCGCGACCTGACCTGGTGCATCAACCCCCC 
CGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCGCCG 
AGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGCTGGAGACCCGCACC 
ACCAACCAGTTCCTGGCCGTGAGCAAGGGCAACTGCAGCGGCCCCACCAC 
CATCCGGGGCCAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACC 
TGGGCCGGGGCTACAACGTGAGCAGCATCGTGACCATGACCAGCCAGGGC 
ATGTACGGCGGCACCTACCTGCTGGAGT^GCCCAATXrTGTC'CKGCAAGC'G 
GAGCGAGCTGAGCCAGCTGAGCATGTACCGCGTGTTCGAGGTGGGCGTGA 
TCGGGAACCCCGGCCTGGGCGCCCCCGTGTTCCACATGACCAACTACCTG 
GAGGAGCCCGTGAGCAACGACCTGAGCAACTGCATGGTGGCCCTGGGCGA 
GCtfGAAGCTGGCCGCCCTGTGCCACGGCGAGGACAGCATCACCATCCCCT 
ACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTG 
TGGAAGAGCCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCGACGA 
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CCCCGTGATCGACCGCCTGTACCTGAGCAGCCACCGCGGCGTGATCGCCG 
ACAACCAGGCCAAGTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTG 
CGCATGGAGACCTGCTTCCAGCAGGCCTGCAAGGGCAAGATCCAGGCCCT 
GTGCGAGAACCCCGAaTGGGCCCCCCTGAAGGACAACCGCATCCCCAGCT 
ACGGCGTGCTGAGCGTGGACCTGAGCCTGACCGTGGAGCTGAAGATCAAG 
ATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGCATGGACCT 
GTACAAGAGCAACCACAACAACGTGTACTGGCTGACCATCCCCCCCATGA 
AGAACCTGGCCCTGGGCGTGATCAACACCCTGGAGTGGATtCCCCGCTTC 
AAGGTGAGCCCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGA 
CTGCCACGCCCCGACCTACCTGCCCGCCGAGGTGGACGGCGACGTGAAGC 
TGAGCAGCAACCTGGTGATCCTGCCCGGCCAGGACCTGCAGTACGTGCTG 
GCCACCTACGACACCAGCCGCGTGGAGCACGCCGTGGTGTACTACGTGTA 
CAGCCCCGGCCGCAGCTTCTTCTACTTCTACCCCTTCCGCCTGCCCATCA 
AGGGCGTGCCCATCGAGCTGCAGGTGGAGTGCTTCACCTGGGACCAGAAG 
CTGTGGTGCCGCCACTTCTGCGTGCTGGCCGACAGCGAGAGCGGCGGCCA 
CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCC 
GCGAGGACGGCACCAACCGCCGCTAG 
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